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A.  Objective:  The  goal  of  this  research  is  to  develop  new  and  better  digital  communi¬ 
cation  systems  using  wavelets  and  multirate  filterbanks,  which  include  new  source  and 
channel  codings  for  antijamming  applications. 

B.  Main  Research  Accomplishments:  We  have  made  several  research  accomplish¬ 
ments  as  follows. 

(i) .  Malvar  Wavelets  on  Arbitrary  Shapes.  We  systematically  constructed  two  dimen¬ 
sional  Malvar  wavelets  defined  on  L-shaped  regions,  which  can  be  used  to  construct 
two  dimensional  Malvar  wavelets  on  arbitrary  shapes.  The  application  of  such  wavelets 
include  the  discrete  cosine  transform  (DCT)  image  coding  for  arbitrary  shaped  objects 
such  as  the  emerged  video  coding  standard  MPEG4.  The  two  dimensional  Malvar 
wavelets  may  be  used  the  eliminate  the  block  effects  produced  by  the  DCT  compres¬ 
sion  at  high  compression  ratios. 

(ii) .  Ambiguity  Resistant  Precoders  for  ISI  Mitigation  Without  Channel  Information. 
We  developed  new  precoding  schemes  using  multirate  filterbanks,  which  are  called 
ambiguity  resistant  precoders  (ARP).  With  ARP,  the  ISI  channel  information  is  not 
necessary  for  the  transmitter  or  the  receiver  to  recover  the  information.  We  character¬ 
ized  all  ARP,  which  are  some  special  families  of  matrix  polynomials.  Any  ARP  can  be 
used  to  resist  ISI,  to  also  resist  additive  random  errors  optimal  ARP  were  studied  and 
characterized. 

(iii) .  A  New  System  Identification  Method.  New  channel  identification  using  chirp 
signals  and  joint  time-frequency  analysis  and  synthesis  was  proposed,  where  the  per¬ 
formance  is  superior  to  the  conventional  method  at  low  SNR. 

(iv) .  Multiwavelet  Transforms.  A  new  prefiltering  for  discrete  multiwavelet  transforms, 
which  has  better  energy  compaction  than  other  prefiltering,  was  obtained. 

(v) .  A  family  of  new  pulse  shaping  filters  that  are  ISI  free  with  the  matched  filtering 
and  also  ISI  free  wothout  the  matched  filtering,  was  obtained,  which  has  been  recently 
extended  by  numerous  researchers. 

(vi) .  A  quantatitive  SNR  analysis  for  joint  time-frequency  analysis  by  introducing 
3dB  SNR  definition  in  the  joint  time-frequency  plane  was  obtained. 

(vii) .  A  frequency  estimation  method  from  the  undersampled  data  with  multiple 
frequencies  was  proposed. 

(viii).  An  optimal  multiple  pulse  repetition  frequency  (PRF)  design  method  was 
proposed. 
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(ix).  A  new  wavelet  based  watermarking  was  proposed. 

C.  Significance:  The  results  obtained  through  this  project  have  advanced  digital  signal 
processing  and  its  applications,  in  particular  wavelets  and  filterbanks,  in  telecommu¬ 
nications,  multimedia  systems,  and  radar  applications  with  some  jamming  resistance 
properties. 
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Multiple  frequency  detection  in 
undersampled  complex-valued  waveforms 
with  close  multiple  frequencies 

Guangcai  Zhou  and  Xiang-Gen  Xia 


Indexing  terms:  Signal  processing.  Frequency  measurement 

The  determination  of  multiple  frequencies  in  undersampled 
waveforms  is  studied,  where  the  multiple  frequencies  are  close  to 
each  other.  Given  multiple  undersampling  rates,  the  maximal 
range  of  the  detectable  multiple  frequencies  is  the  least  common 
multiple  of  these  multiple  rates  under  the  assumption  that  these 
rates  are  larger  than  twice  the  maximal  distance  between  the 
multiple  frequencies. 

Introduction  A  traditional  method  to  detect  the  frequencies  of  a 
multiple  frequency  waveform  x(t)  is  to  sample  x (/)  at  the  Nyquist 
sampling  rate  and  then  to  implement  the  discrete  Fourier  trans¬ 
form  (DFT).  The  peaks  in  the  DFT  domain  provide  the  frequen¬ 
cies  of  this  waveform.  However,  when  the  maximal  frequency  in 
the  waveform  is  very  large,  the  sampling  rate  also  needs  to  be  very 
large.  Recently,  methods  to  detect  a  single  frequency  in  an  under- 
sampled  real -valued  waveform  have  appeared  [1  -  4].  A  single  fre¬ 
quency  in  a  real-valued  waveform  actually  corresponds  to  two 
symmetric  frequencies  in  a  complex-valued  waveform,  where  their 
coefTicients  are  complex  conjugates  of  each  other.  Most  recently, 
general  multiple  frequency  detection  in  undersampled  complex¬ 
valued  waveforms  is  studied  in  [5].  Given  L  undersampling  rates 
of  a  complex-valued  waveform  with  K  multiple  frequencies  with  L 
>  r\H  a  range  for  the  detectable  frequencies  is  given  in  [5],  which 
is  the  minimum  of  the  least  common  multiples  of  all  possible  rj 
difTerent  rates  in  the  L  undersampling  rates.  As  mentioned  in  [5], 
this  range  may  not  be  optimal,  in  particular  when  some  prior 
information  on  the  multiple  frequencies  is  known.  It  should  be 
noted  that,  the  larger  the  range  of  the  detectable  multiple  frequen¬ 
cies,  the  better  the  sampling  efficiency,  when  the  multiple  sampling 
rates  are  given.  The  main  idea  in  the  above  method  is  the  imple¬ 
mentation  of  multiple  DFTs  of  the  undersampled  waveforms  and 
each  DFT  gives  a  set  of  residues  of  the  multiple  frequencies  mod¬ 
ulo  the  sampling  rates.  Then,  the  (generalised)  Chinese  remainder 
theorem  (CRT)  is  used  in  the  determination  of  these  multiple  fre¬ 
quencies. 

In  this  Letter,  we  study  the  case  when  the  multiple  frequencies 
in  a  complex-valued  waveform  are  assumed  in  a  priori  to  be  close 
to  each  other  within  a  distance  W.  Given  L  undersampling  rates, 
each  of  wliich  is  >  2  W%  we  prove  that  the  maximal  range  for  the 
detectable  multiple  frequencies  is  the  least  common  multiple  (1cm) 
of  these  sampling  rates. 

Uniqueness  of  multiple  frequency  determination:  Without  loss  of 
generality,  the  multiple  frequencies  in  a  waveform  x(t)  are 

assumed  /,  / . fK  with  /</><...<  fK.  The  complex-valued 

waveform  x(r)  is  represented  by 

x(<)  =  y\-Ue2*jAl 
*=1 

where  Ak,  k  =  1 .  Af,  are  nonzero  coefficients.  Let  m,  be  one 


=  =ytA^J,inim'  "eZ  (1> 

We  first  consider  the  single  frequency  case.  In  this  case  xU)  = 
A.tr*  v  and  xmt[n]  =  A  e ~l.  Let  /  =  /i,w,  +  r„  0  <  r,  <  w,  -  1. 
i.e.  r,  =  /  (mod /»,).  then  the  m,  point  DFT  of  0  <  n  <  m,  - 
l.  is 

DFTmMm.  M)  =  .-MU--r|)  0  <  A*  <  in i  -  1  (2) 

This  means  that  the  residue  r,  =  f  mod  mt  can  be  detected  from 
the  m,  point  DFT  of  xm,l  [wj.  Now .  let  m: . mL  be  other  under¬ 
sampled  rates  and  lem !m,.  m: . mL  \  be  the  1cm  of  the  integers  in 

the  set  {m; . mL\.  By  the  CRT  we  have 

Lenvna  I:  If  /,  <  kmjm,.  nu . mL\.  then  f  can  be  uniquely 

determined  from  the  residues  r,  =  f  (mod  m,)  using  the  above 
undersampled  DFTs. 

We  now  consider  the  multiple  frequency  case.  Let  rk{  -  f  mod 
mh  Then,  the  ml  point  DFT  of  xmf[n]%  0<n<m,~  I  is 
K 

DFTm, (xm,  [n])  =  52  0  <  i  <  m,  -  1 

1  1  <  l  <  L  (3) 

From  this  representation,  we  see  that,  without  the  knowledge  of 
amplitudes  (it  is  usually  impossible,  for  example,  if  all  of  the 
nonzero  amplitudes  are  equal),  generally  we  cannot  match  the 
peaks  and  the  residues  precisely  (see  example  in  Section  3).  How¬ 
ever,  if  we  know  that  ru  is  the  residue  of  fk  modulo  m,  for  /  =  1,  2, 
....  L,  i.e.  rkl  =  fk  mod  m{  and  fk  is  upper  bounded  by  lcm{mj, 

mL},  then,  by  Lemma  l,  we  can  determine  the  frequencies / . fK 

uniquely.  In  conclusion,  we  have  the  following  lemma. 

Lemma  2:  If  max{/ . fk }  <  lcm{m . . .  and  we  know  the 

residue  rkl  of  fk  modulo  mt  precisely  for  1  <  /  <  L  and  1  <  k  £  K% 
then  we  can  determine/  uniquely. 

If  we  only  know  that  the  set  {rkh  k  -  1,  ... K },  is  the  set  of  all 

residues  of/,  1  <  k  <  K  modulo  m,  for  /  =  1,  2 . L,  in  general  it 

is  impossible  to  determine  which  one  in  the  set  is  the  residue  of/ 
and  therefore  in  general  it  is  impossible  to  determine  the  multiple 
frequencies  /.  It  is  possible,  however,  if  some  prior  information 
about  the  multiple  frequencies /  is  known,  which  is  the  goal  of  the 
rest  of  this  Letter. 

Without  loss  of  generality,  we  may  assume  that/  </  <  ...  <fK ■ 
Let  W  =  m3xlili/iK\f,-f)  =/*  -/.  The  undersampling  rates  are 

chosen  m, . mL  such  that  . mL]  >  2W.  Then  we  have 

the  following  lemma. 

Lemma  3:  Under  the  above  condition  on  m(  and  /,  1  <  /  <  L,  1  < 
k  <  JC  the  residue  ru  of /  modulo  m,  is  uniquely  determined  by  the 
set  {rlh  ....  r*,}  for  1  <  /  <  L  and  1  <  k  <  K. 

Proof:  Since  min{m, . mL}  >  2  W%  max{[/  -/)}  <  W  and/ 

are  all  distinct,  it  is  not  hard  to  see  that  the  elements  in  the  set  {rw, 
....  rgj}  are  difTerent  from  each  other.  Let  {r,„  ....  =  {a,, a*} 

with  a,  <  Oj  <  ...  <  a*.  By  the  following  representation 

r  fi  =  ru  4  ki  mi 
f2  =  T2/  4  koTTli 

i  .  (4) 

L  / k  =  rh  i  +  kf<mi 

and  the  facts  that  fK~f<  W and  m,  >  2 W,  we  have  k^<k2< 

<  kK  and  kK  -  kx  =  0  or  1.  If  kK  =  ku  then  ru  <  r2/  <  ...  <  r„  and  rw 

-  ru  <  W.  If  k}  =k,  =  ...  =  kn  =  ...  =  kK  =  kx  +  1  for  some  /  € 
{1,2,  K-  1},  then: 

f\  =  ru  +  klml . /  =  ru  4  (5) 

and 

Z+i  =  r(l+1)/  4  (ki  4  l)m/ . fs  =  rKi  4  ( ki  4  l)m/  (6) 

By  eqn.  5,  we  have  rw  <  r2/  <  ...  <  r,f  and  rd  -  rw  <  W.  By  eqn.  6, 
we  have  <  r((.2w  <  ...  <  r„  and  ra  -  <,  W.  Furthermore  rw 

-  ru  =  mt  -  (fK  -/)  >  W.  Hence,  we  can  determine  the  values  of 
rlh  ...,  ru  uniquely  according  to  the  following  law:  if  a*  -  dj  <  W \ 

then  we  have  r„  =  a, . ru  =  a^.  If  a,.,  -  a,  >  W \  for  some  i,  1  £ 

i  <  K—  1  then  rit  —  ....  r[k_Uf  =  r{K_„xu  —  cx, ,  ..i,  —  ex,. 


r~  t  rr*‘rr>ryhnr*C'  t  r~~r~rr~  o  n  f  t.  *  r\r>-r  \/„  I  oo  h  I  -  t  cr 


Therelorc.  in  all  cases  we  can  determine  me  resiJues  unique  i> 

Combining  the  above  lemmas,  we  have  obtained  the  following 
main  result.  — , 

Theorem  T  Assume  complex  valued  waveform  .\1f)  contains  K  dif¬ 
ferent  frequencies  /,  for  I  <  k  <  A  Let  w,.  I  <  /  <  L  be  sampling 
rates  in  the  undersampled  versions  v.,J«]  of  xU)  in  eqn.  I  with  m, 
replaced  by  m,.  \  <  I  <  L.  Then  the  K  frequencies  /  for  I  <  k  <  A 
can  be  uniquely  determined  by  using  the  m,  point  DFT  of  v,j/i] 

for  1  <  l<  L  if  max’/, . fK\  <  lem |w . "hi  and  min',m . 

mt|  >  2max,«  ,s ,, »/  -  fi¬ 
ll  is  clear  that  the  range  of  lcmjm, . wc|.  Pven  m,.  ....  m, 

is  the  maximal  one.  The  difference  between  the  above  result  and 

the  result  in  (5)  is  that  the  knowledge  of  minim, . m, )  >  2max, . 

'  -J}  is  needed  in  this  Letter,  while  no  knowledge  is  neces¬ 

sary  in  the  result  in  [5]  as  mentioned  in  the  introduction. 

Multiple  frequency  determination  algorithm:  For  simplicity,  we 
assume  m, . mL  are  pairwise  coprimes.  We  assume  the  condi¬ 

tions  in  theorem  1  hold.  Now.  we  give  the  concrete  determination 
algorithm  as  follows: 


Step  1.  Sample  the  waveform  Mr)  with  the  sampling  rates  m,  to 
obtain  .vjn]  for  I  <  /  <  L. 

Step  2.  Implement  the  m,  point  DFT  of  xjn],  0  <n<m,-  1.  to 
detect  the  set  S,  =  {ct„, ....  a„}  of  K  peaks  in  the  DFT  domain  for 
1  <I<L. 

Step  3.  For  each  peak  set  {a . a*}  =  S,  with  a,  <  a  <  ...  < 

if  a*  -  a,  <  W.  then  we  have  r„  =  a, . ru  =  a*.  If  a,.,  -  Oi  >  " 

and  a,  -  a,  <  W'.  for  some  /,  1  <  i  £  K  -  1  then  ru  -  a,. . 

=  a  r  ,  =  a,  r„  =  a.  Hence  we  can  determine  these  resi- 

dues  r„  of /,  uniquely. 


Step  4.  By  Step  3  we  know: 


fk  =  fkl 

(mod  mi) 

k  =  1 . A' 

/  =  1 . L 

(-) 

We  define: 

i  and  Mt  = 

M 

mi 

(8) 

Since  m  and  M  are  coprime. 

there  are  solutions  N,  of 

St  Ml  -- 

=  1  (mod  mi) 

(9) 

With  these  S.. 

the  solution  fk 

is 

fk  =  7-Ai.Vi.Wi  +  •  •  * 

+  nti-VtA/t 

(mod  M) 

(10) 

The  above  closed  formulas  for /,  are  the  solutions  of/,.  For  the 
CRT  involved  in  step  4.  see.  for  example.  [7]. 

Examples  In  this  Section,  we  see  a  simple  example.  Consider  a 
signal  with  three  frequencies,  where  their  differences  are  at  most 
10 Hz.  le  W  =  10.  We  sample  this  signal  with  frequences  m,  - 
■*7  Hz.  m.  =  28  Hz.  m,  =  29  Hz.  Hence  if  the  largest  frequency  of 
this  signal  is  less  than  27  x  28  x  29  =  21924Hz,  by  theorem  1.  we 
can  uniquely  determine  these  three  frequencies.  Let  fx  —  20008  Hz, 
/.  =  20013Hz.  and/,  =  20017Hz."  * 

For  the  sampling  rate  m,  =  27  Hz,  we  obtain  the  DFT  peak  val¬ 
ues  at  {1.6.  10}.  For  the  sampling  rate  m,  =  28Hz.  we  obtain  the 
DFT  peak  values  at  {16,  21,  2fj}}  For  the  sampling  rate  m,  = 
29Hz.  we  obtain  the  DFT  peak  {/allies  at.{3,  7.  27}.  Then,  from 
step  3  we  have  ] 

ru  =  1  r2i  =6  r3i  =  10 

rn  =  16  7*22  =f£1  7*32=25 

7*13  =  27  r23  =  3  r33.=  t 

By  step  4,  the  multiple  frequencies  £an  be  determined. 

If  we  take  another  sample  wit£  rate  n\  =  31,  then  the  signal 
with  highest  frequency  <  679  644  H^  can  be  uniquely  determined. 

Conclusion:  In  this  Letter,  we  studied  the  detection  of  multiple  fre¬ 
quencies  in  undersampled  complex*- valuecf  waveforms,  where  the 
multiple  frequencies  are  close  to  each  other.  Given  the  undersam¬ 
pling  rates,  the  maximal  range  of  the  detectable  multiple  frequen¬ 
cies  was  given,  that  is  the  1cm  of  the  undersampling  rates,  when  all 
these  rates  are  larger  than  twice  the  maximal  distance  between  the 
multiple  frequencies.  The  main  advantage  of  undersampling  is  the 
hardware  cost  reduction  [6]. 
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Performance  of  adaptive  muftisensor 
decision  feedback  equaliser  for  time-varying 
frequency  selective  radio  channels 

S.  Buljore,  J.F.  Diouris  and  J.  Saillard 


Indexing  terms:  Adaptive  equalisers.  Radio  applications 

A  multisensor  decision  feedback  equaliser  based  on  the  minimum 
mean  squared  error  (MMSE)  criterion  is  studied.  The  superiority 
of  the  performance  of  the  multisensor  equaliser  is  shown  by 
simulation  of  a  whole  communication  system  in  which  the 
adaptive  equaliser  is  incorporated.  The  recursive  least  squares 
(RLS)  algorithm  is  used  to  update  the  coefficients.  From  the 
results  obtained  for  a  time-varying  urban  terrain  channel  model, 
the  extremely  interesting  tracking  capability  of  the  multisensor 
equaliser  is  shown. 

Introduction :  Frequency  selective  and  time-varying  radio  channels 
considerably  degrade  the  performance  of  OSM-type  mobile  digital 
communication  systems  [1].  In  addition  to  equalisation,  the  reduc¬ 
tion  in  the  variation  of  the  signal  to  noise  ratio  (SNR)  due  to  time 
variation  of  the  radio  channels  at  the  receiver  using  diversity  tech¬ 
niques  is  a  desirable  asset .  This  Letter  presents  a  multisensor  deci¬ 
sion  feedback  equaliser  (DFE)  implemented  with  the  recursive 
least  squares  algorithm  for  a  time-varying  typical  urban  channel. 
A  Monte  Carlo  simulation  of  the  digital  link  incorporating  the 
multisensor  equaliser  is  carried  out.  The  channel  model  considered 
is  given  by  the  GSM  recommendations  for  a  typical  urban  (TU) 
environment  [2].  The  superiority  of  the  performance  of  the  multi- 
sensor  equaliser  to  combat  intersymbol  interference  (ISI)  and  fad¬ 
ing  is  shown.  Finally,  conclusions  on  the  improved  performance 
and  the  very  interesting  tracking  ability  of  the  multisensor  DFE 
are  drawn. 

System  and  channel  model:  The  scheme  of  the  baseband  system 
simulation  is  illustrated  in  Fig.  1.  The  bit  sequences  are  transmit¬ 
ted  using  a  DQPSK  modulation  scheme  through  a  square  root 
raised  cosine  filter.  The  channel  model  used  is  the  typical  urban 
channel  given  in  Table  1.  The  receiver  consists  of  another  square 
root  raised  cosine  filter,  an  adaptive  single/multi-sensor  equaliser. 
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System  Identification  Using  Chirp  Signals  and  Time- 
Variant  Filters  in  the  Joint  Time-Frequency  Domain 

Xiang-Gen  Xia,  Member ,  IEEE 


Abstract — In  this  paper,  we  propose  a  novel  method  to  iden¬ 
tify  an  unknown  linear  time  invariant  (LTI)  system  in  low 
signal-to-noise  ratio  (SNR)  environment.  The  method  is  based 
on  transmitting  chirp  signals  for  the  transmitter  and  using  linear 
time-variant  filters  in  the  joint  time-frequency  (TF)  domain  for 
the  receiver  to  reduce  noise  before  identification.  Due  to  the  TF 
localization  property  of  chirp  signals,  a  large  amount  of  additive 
white  noise  can  be  reduced,  and  therefore,  SNR  before  iden¬ 
tification  can  be  significantly  increased.  This,  however,  cannot 
be  achieved  in  the  conventional  methods,  where  pseudo-random 
signals  are  used,  and  therefore,  noise  reduction  techniques  do  not 
apply.  Our  simulation  results  indicate  that  the  method  proposed 
in  this  paper  outperforms  the  conventional  methods  significantly 
in  low  SNR  environment  This  paper  provides  a  good  application 
of  time-frequency  analysis  and  synthesis. 

I.  Introduction 

THE  SYSTEM  identification  problem  is  a  classical  and 
important  problem  in  signal  processing,  which  has  appli¬ 
cations  in  many  fields  including  channel  estimation  in  wireless 
communications.  There  have  been  extensive  studies  on  this 
problem;  see,  for  example,  [2],  [3],  [28],  [31],  and  [32].  The 
problem  can  be  stated  as 

y[n]  =  T  Mn  k]x[k]  +  v[n )  (1.1) 

k 


where 


x[k]  transmitted  signal; 

/i[n]  impulse  response  of  a  linear  time  invariant  (LTI) 
system  (or  channel); 
u[n]  additive  noise; 
y[n]  received  signal. 

The  problem  is  to  identify  the  LTI  system  transfer  function 
H{uj)  of  h[n]  given  the  input  and  the  output  signals  x[n ]  and 

yN- 

The  conventional  method  for  solving  the  above  problem  is 
the  least-squared  solution  method  that  is  equal  to  the  cross- 
spectral  method  in  stationary  cases,  i.e.,  the  system  transfer 
function  H( uj)  can  be  estimated  by 


H{»)  = 


Sxyjuj) 

SXX{v) 


0-2) 
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where  Sxy(uj)  is  the  cross-spectrum  of  x[n]  and  y[n],  and 
Sxx( v)  is  the  autospectrum  of  x[n].  When  the  additive  noise 
v[n]  in  (1.1)  is  a  zero-mean  Gaussian  process  and  statistically 
independent  of  the  input  signal  x[n],  the  estimate  in  (1.2)  is 
asymptotically  unbiased,  and  its  error  variance  approaches  the 
Cramer-Rao  lower  bound  that  is  proportional  to  the  variance 
of  the  additive  noise  v[n].  Clearly,  the  performance  is  limited 
by  this  noise  variance,  or  the  signal-to-noise  ratio  (SNR). 
When  this  SNR  is  low,  the  performance  of  the  estimate  in 
(1.2)  is  poor.  Since  the  autospectrum  of  the  input  signal  x[n] 
is  in  the  denominator  in  the  estimate  (1.2),  the  input  signal 
is,  in  general,  chosen  as  a  pseudo-random  signal  with  flat 
spectrum  [4].  With  these  kinds  of  input  signals,  noise  reduction 
techniques  before  system  identification  do  not  apply.  As  a 
matter  of  fact,  any  traditional  noise  reduction  technique,  such 
as  any  Fourier  transform  technique,  does  not  perform  well 
for  wideband  signals.  This  implies  that  it  is  not  possible  to 
increase  the  SNR  or  the  performance  of  the  estimate  (L2)  by 
transmitting  a  pseudo  random  signal  and  using  the  conven¬ 
tional  Fourier  noise  reduction  techniques.  Several  questions 
arise  here: 

i)  Can  we  transmit  other  wideband  signals,  such  as  chirp 
signals,  instead  of  pseudo  random  signals? 

ii)  If  so,  can  we  take  the  advantage  of  these  wideband 
signals  and  reduce  the  noise  v[n]  in  (1.1)? 

iii)  If  so,  can  we  improve  the  performance  of  the  estimate 
(1.2)  after  denoising? 

The  aim  of  this  paper  is  to  positively  answer  these  questions. 
The  main  idea  is  the  following.  Chirp-type  signals  are  trans¬ 
mitted,  which  have  wideband  characteristics  in  the  frequency 
domain  but  concentrate  in  the  joint  time-frequency  domain. 
Chirp-type  signals  are  used  quite  often,  such  as  in  radar  and 
in  FM  in  communications  systems.  The  TF  concentration 
property  usually  holds  after  an  LTI  system  (this  will  be  seen 
later).  Since  a  joint  TF  distribution  usually  spreads  noises  and 
localizes  signals,  in  particular  chirps,  the  receiver  may  use  a 
TF  analysis  technique  (see,  for  example,  [5]— [27])  to  map  the 
received  signal  y[n\  from  the  time  domain  into  the  joint  time- 
frequency  domain.  In  this  way,  the  SNR  can  be  significantly 
increased  in  the  joint  TF  domain,  and  the  receiver  may  be 
able  to  see  patterns  in  the  joint  TF  plane  and  therefore  reduce 
the  noise  by  filtering  in  the  joint  TF  domain.  This  filtering 
is  basically  a  time-variant  filtering.  We  use  this  name  in  the 
rest  of  this  paper.  The  model  (1.1)  after  a  time-variant  filter 
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becomes 

y[ri\  =  h[n  -  k]x[k]'+  v[n]  (1.3) 

k 

where  v[n\  is  the  new  noise  after  the  filtering. 

The  time-variant  filter  used  in  this  paper  is  based  on  the 
discrete  Gabor  transforms,  which  was  studied  in  [5]-[7].  For 
chirp-type  signals,  about  13  dB  SNR  is  increased  consistently 
with  this  filter  in  [6].  When  the  original  SNR  in  (1.1)  is  not 
too  low,  say,  for  example,  above  —  1  dB,  the  new  SNR  in  (1.3) 
may  reach  a  significant  high  level  so  that  the  estimate  of  H{u) 
from  y[n]  and  x[n\  is  accurate  enough  for  many  applications. 
In  this  paper,  both  denoising  with  several  mask  design  methods 
and  system  identification  simulations  are  performed.  These 
simulations  show  that  a  much  better  performance  over  the 
conventional  method  can  be  achieved. 

It  should  be  pointed  out  that  the  optimal  training  signal 
design  for  dynamic  system  identification  has  a  long  history 
dating  back  over  20  years.  The  design  methods  are  traditionally 
based  on  the  minimization  of  the  Cramer-Rao  bound  for 
the  system  parameter  estimation  in  either  the  time  or  the 
frequency  domain  (see,  for  example,  [28]— [32])  but  not  in 
the  joint  TF  domain.  The  aim  of  this  paper  is,  however,  not 
focused  on  the  optimal  training  signal  design,  although  it  is 
a  very  interesting  topic.  Denoising  before  identification  using 
nonredundant  discrete  wavelet  transform  was  studied  in  [33] 
for  chemical  process  control  applications. 

This  paper  is  organized  as  follows.  In  Section  II,  we  briefly 
review  discrete  Gabor  transforms  and  the  iterative  time-variant 
filtering  studied  in  [5]-[7].  In  Section  m,  we  use  the  time- 
variant  filter  studied  in  Section  II  to  reduce  additive  white 
Gaussian  noise  for  a  received  signal.  The  filtering  problem  in 
this  paper  has  its  own  characteristics  due  to  the  fact  that  the 
transmitter  and  the  receiver  know  the  transmitted  chirp  signal 
x[n),  and  therefore,  its  TF  information  is  known  a  priori.  This 
TF  information  can  be  used  in  designing  a  mask  in  the  time- 
variant  filtering.  In  Section  IV,  we  utilize  the  conventional 
system  identification  method,  i.e.,  the  cross-spectral  method 
(1.2),  after  the  denoising  in  Section  HI.  In  Section  V,  we 
conclude  this  paper  by  addressing  some  possibilities  for  further 
improvements. 

n.  Discrete  Gabor  Transform 
and  Time- Variant  Filtering 

There  have  been  many  TF  analysis  techniques,  such  as 
Wigner-Ville  distributions  in  the  Cohen’s  class,  spectrogram 
(short-time  Fourier  transform  or  Gabor  transform  or  DFT 
filterbanks),  and  scalogram  (wavelets)  (see,  for  example,  [5], 
[23]— [27]).  Some  of  them,  such  as  bilinear  TF  distributions, 
have  high  resolution  but  have  crossterms  for  multicomponent 
signals.  Some  of  them,  such  as  linear  techniques  (for  exam¬ 
ple,  Gabor  transforms  and  wavelet  transforms),  do  not  have 
crossterms  for  multicomponent  signals  but  may  not  have  very 
high  resolutions.  Since,  in  this  paper,  we  deal  with  a  linear 
combination  (or  a  linear  system)  of  various  chirp  signals,  it  is 
important  for  a  TF  analysis  technique  not  to  have  crossterms 
while  it  should  also  have  a  good  resolution.  This  leads  us  to 


consider  Gabor  transforms.  In  this  section,  we  first  review  the 
discrete  Gabor  transforms  (DGT). 

Since  oversampled  DGT  is  more  robust  for  noise,  it  is 
usually  used  in  noise  reduction  applications.  However,  a 
disadvantage  for  oversampled  DGT  is  that  it  is  not  an  onto 
mapping.  In  other  words,  not  every  signal  S[M]  in  the  DGT 
transform  domain  corresponds  to  a  time  domain  signal  $[n] 
so  that  the  DGT  of  s[n]  is  exactly  equal  to  S[k,l].  This 
causes  problems  in  filtering  in  the  DGT  transform  domain, 
which  is  that  the  filtered  signal  in  the  DGT  transform  domain 
may  not  correspond  to  any  time  domain  signal  as  shown 
in  Fig.  1.  An  intuitive  solution  for  this  problem  is  to  take 
the  least-squared  error  (LSE)  solution  in  the  time  domain 
(see,  for  example,  [8H13]).  The  LSE,  however,  usually  does 
not  have  a  desired  TF  characteristics  in  the  DGT  transform 
domain.  When  a  signal  is  very  long,  the  computational  load 
for  the  LSE  solution  is  significantly  high  because  of  the 
inverse  matrix  computation.  Based  on  these  observations, 
an  iterative  algorithm  was  proposed  in  [5]-[7].  Conditions 
on  the  convergence,  properties  of  the  limit  signals,  and  the 
relationship  between  the  LSE  solutions  and  solutions  from 
the  iterative  algorithms  were  obtained  in  [6]  and  [7],  where 
a  significant  improvement  over  the  LSE  solution  was  also 
shown.  The  second  part  of  this  section  is  to  briefly  review 
some  of  these  results. 

A.  Discrete  Gabor  Transform 

We  first  review  some  basics  on  the  DGT,  which  is  necessary 
for  this  paper.  For  more  about  the  discrete  short-time  Fourier 
transform,  see  [14],  for  more  about  DFT  filterbanks,  see  [15], 
and  for  more  about  the  DGT,  see,  for  example,  [16]— [22].  Let  a 
signal  s[k],  a  synthesis  window  function  /i[n],  and  an  analysis 
window  function  7 [n]  be  all  periodic  with  same  period  L .  Then 


m— 1  N-l 

#]  =  E  C'm.nAm.nl*]  (2-D 

m=0  n=0 

CW,  =  f>(*bm,n[*]  (2.2) 

fc=0 

hm,n[k]  =  h[k  -  mAM}W£*Nk  (2.3) 

7m,n[fc]  =  7[*  -  mAM]W£ANk  (2.4) 


and  WL  =  exp(j27r/L),  j  =  \/-f.  The  coefficients  Cm,„  are 
called  the  DGT  of  the  signal  s[k],  and  the  representation  (2.1) 
is  called  the  inverse  DGT  (IDGT)  of  the  coefficients  Cm,„. 
One  condition  on  the  analysis  and  synthesis  window  functions 
7 [k]  and  h[k]  obtained  by  Wexler  and  Raz  is  the  identity1 

L- 1 

Y^h[k  +  mN]W[nMk^[k] 

k- 0 

=  <5[ro]%],  0  <  m  <  AN  —  1,  0  <  n  <  AM  -  1 

(2.5) 

1  If  we  take  the  inverse  discrete  Fourier  transform  with  respect  to  the 
parameter  n  at  the  both  sides,  the  system  (2.5)  is  the  same  as  the  one  obtained 
in  [14]  when  all  convolutions  are  considered  to  be  cyclic  convolutions  for 
finite  length  signals  in  [14]. 
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(a)  TF  of  noise 


(b)  TF  of  chirp 
signal 


(c)  TF  of 
signal 
+  noise 


Fig.  1.  TF  transform  illustration. 


where  AM  and  AN  are  the  time  and  the  frequency  sampling 
interval  lengthes,  and  M  and  N  are  the  numbers  of  sampling 
points  in  the  time  and  the  frequency  domains,  respectively, 
M  ■  AM  =  N  *  AN  =  L,  MN  >  L  (or  AM  AN  <  L).  The 
critical  sampling  case  is  when  M  *  N  =  AM  ■  AN  =  L.  The 
condition  (2.5)  on  window  functions  h  and  7  can  be  rewritten 
in  matrix  form  as 

Hpxl1lx\  ~  Mpxl  (2.6) 

where  the  subscript  m  x  n  means  the  m  by  n  matrix  p  = 
AM  ■  AiV,7txl  =  (7[0],7[1],---,7[L  -  1])T,  and  fipxi  = 
( 1 , 0,  ♦  •  * ,  0)T  and  the  element  at  the  (mAM  +  n)th  row  and 
the  k\h  column  in  the  matrix  HpXi  is 

h[k  +  mN]W[nMk ,  0  <  m  <  AN  -  1 

0<n<  AM-1,  0  <  k  <  L  -  1. 

In  the  critical  sampling  case  and  when  Hpxl  has  full  rank, 
there  is  a  unique  solution  for  the  analysis  window  function 
7[n].  In  the  oversampling  case  and  when  HpXi  has  full  rank, 
there  are  infinite  many  solutions  for  the  system  (2.5).  Among 
them,  the  minimum  norm  solution  was  given  in  [17] 

Ttxi  =  1  (2.7) 

where  f  means  the  complex  conjugate  transpose.  It  was  proved 
in  [18]— [20]  that  the  above  minimum  norm  solution  is  also  the 
most  orthogonal-like  solution,  i.e.,  (a  more  general  form  was 
given  in  [22]) 

||7Lxi  -  hLxi\\  =  .  min  ||7ixi  -  hLx i||. 


The  DGT  and  IDGT  can  be  also  represented  in  matrix  forms. 
Let 

C  =  {Co,OjCo,i,'--  iCm-i,n-i)t 

5  =  (s[0],s[1],--‘,s[L-  1])t. 

The  DGT  can  be  represented  by  the  MN  x  L  matrix  Gmnxl 
with  its  ( mN  +  n)th  row  and  A;th  column  element 

7m,n[fc]  ~  mAM]H/£’nAiVfc,  0  <  m  <  M  -  1 

0  <  n  <  iV  —  1,  0  <  k  <  L  -  l. 

The  IDGT  can  be  represented  by  the  L  x  MN  matrix  Hlxmn 
with  its  fcth  row  and  (mN  +  n)th  column  element 

hmtn[k ]  =  h\k  -  mAM)W2*Nk,  0  <  m  <  M  -  1 
0  <  n  <  N-  1,  0  <  A;  <  L  -  1. 

Thus  . 

C  =  Gmnxls  and  s  =  HlxMnC.  (2.9) 
The  condition  (2.5)  implies  that 

HlxmnGmnxL  =  Ilxl  (2.10) 

where  ILxl  is  the  L  x  L  identity  matrix. 

B.  Iterative  Time-Variant  Filtering  Algorithm 

We  next  want  to  briefly  review  the  iterative  time-variant 
filtering  algorithm  proposed  in  [5]-[7].  This  algorithm  is  used 
later  in  the  denoising  for  the  system  identification  problem. 

The  oversampling  of  the  DGT  adds  redundancy,  which 
is  usually  preferred  for  noise  reduction  applications.  From 
(2.1M2.5),  (2.9),  and  (2.10),  one  can  see  that  an  L- 
dimensional  signal  s  is  transformed  into  an  MJV-dimensional 
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Fig.  2.  Iterative  time-varying  filtering  algorithm. 


signal  C,  and  MN  is  greater  than  L  due  to  the  oversampling. 
Therefore,  only  a  small  set  of  MTV-dimensional  signals  in  the 
TF  plane  have  their  corresponding  time  waveforms  with  length 
L.  Let  Dmkxmn  denote  the  mask  transform,  specifically,  a 
diagonal  matrix  with  diagonal  elements  either  0  or  1.  Let  a 
be  a  signal  with  length  L  in  the  time  domain.  The  first  step 
in  the  time-variant  filtering  is  to  mask  the  TF  transform  of  s 

Cl  =  DmNxMnGmNxL* 

where  D\/.vx  mn  masks  a  desired  domain  in  the  TF  plane. 
Since  the  DGT  Gmnxl  is  a  redundant  transformation,  the 
IDGT  of  Ci,HLxmnCi  may  not  fall  in  the  mask.  In  other 
words,  in  general 

GmsxlHlxmsCi  /  Dmn X mnGmnxlHl x MnC  1 

(2.11) 

where  MN  >  L,  which  is  illustrated  in  Fig.  1(e).  Notice  that 
in  the  critical  sampling  case,  i.e.,  MN  =  L,  the  inequality 
(2.11)  becomes  an  equality.  An  intuitive  method  to  reduce  the 
difference  between  the  right-  and  the  left-hand  sides  of  (2.1 1) 
is  to  mask  the  right-hand  side  of  (2.11)  again  and  repeat  the 
procedure,  which  leads  to  the  iterative  algorithm 

50=5  (2.12) 

Ci+i  =  DmkxmnGmnxl8i  (2.13) 

Si+  r  =  HixmnCi+i,  l  —  0, 1,2,  •  ■  • .  (2.14) 

The  above  iterative  algorithm  is  illustrated  in  Fig.  2. 

Before  going  to  the  convergence,  let  us  see  what  the  LSE  is. 
Based  on  the  definition,  the  LSE  solution  is  the  L  x  1  vector 
x  that  minimizes 

IIGa/.VxL^  -  D Kf N x M nG M pf  x L&W 

=  imn  || Gmnxl*  -  DmnxMnGmnxlsW  (2.15) 

Then 

x  =  (GmnxlGmnxl)~1GI{NxLDMNxmnGmnxls- 

(2.16) 


Clearly,  when  the  signal  length  L  is  large,  the  inverse  matrix 
computation  is  expensive.  Although  the  error  in  (2.15)  is  min¬ 
imized,  the  DGT  of  the  least-squared  solution  x  may  not  fall 
in  the  mask  Dmnxmn •  GmnxL*  i1  DmnxMnGmnxL* 
when  MN>L. 

The  complexity  for  the  iterative  algorithm  (2. 12) — (2.14)  is, 
however,  low,  which  does  not  need  to  compute  inverses  of 
large  size  matrices.  By  considering  the  DGT  and  IDGT  in 
(2.1M2.4),  the  computational  complexity  in  (2.12M2.14)  is 
proportional  to  the  signal  length  multiplied  by  the  window 
length,  i.e.,  LLw-  Notice  that  the  complexity  of  directly 
computing  the  inverse  matrices  in  (2.16)  is  proportional  to  L3. 
Therefore,  when  the  length  of  window  functions  h  and  7  is 
much  shorter  than  the  length  of  the  signal  5,  the  computational 
complexity  in  the  iterative  algorithm  (2.12) — (2. 14)  is  much 
lower  than  the  one  for  the  least-squared  solution  in  (2.16). 

We  next  want  to  list  several  related  results  on  the  above 
iterative  algorithm  obtained  in  [6]  and  [7],  such  as  the  conver¬ 
gence,  the  properties  of  the  limit  signals,  and  the  relationship 
between  this  algorithm  and  the  LSE  solution.  These  results 
are  based  on  the  condition  on  the  window  functions  h  and  7 
obtained  in  [6]  and  [7]: 

AN-1 

Y,  7m[lN  +  k]h[lN  +  k  +  mAM] 

i= 0 

AN- 1 

=  Y  h*[lN  +  k]j[lN  +  h  +  mAM]  (2.17) 

1=0 

for  fc  =  0, 1, ■  * *,  JV  —  1  and  m  =  0, 1, * •  • , M  -  1. 

Theorem  1:  When  the  synthesis  and  the  analysis  window 
functions  h[n\  and  7[n]  satisfy  condition  (2.17),  the  iterative 
algorithm  (2.12H2.14)  converges. 

There  are  two  trivial  cases  where  (2.17)  holds.  The  first 
case  is  the  orthogonal  case  h\n }  —  7 [n]  for  all  integer  n.  The 
second  case  is  the  critical  sampling  case  AM  =  N.  Notice 
that  the  continuous  Gabor  transform  is  never  orthogonal  unless 
the  window  functions  are  badly  localized  in  the  frequency 
domain.  This,  however,  is  not  the  case  for  the  DGT.  The 
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original  chirp  time  waveform  x[n] 


Fig.  3.  Transmitted  signal  x[n]  and  its  Fourier  spectrum  A’(u>). 


most  orthogonal-like  solution  was  studied  by  Qian  et  al  in 
[18H20].  They  showed  that  it  is  possible  to  have  the  analysis 
window  function  7  very  close  to  the  synthesis  window  function 
h  when  h  is  truncated  Gaussian.  The  error  between  h  and  7  is 
less  than  2  x  10"6  (see  Fig.  4)  while  they  are  of  unit  energy, 
and  therefore,  the  error  is  negligible.  It  was  shown  in  [6]  that 
the  performance  of  the  iterative  algorithm  strongly  depends 
on  (2.17).  When  this  condition  does  not  satisfy,  the  iterative 
algorithm  may  not  converge. 

Theorem  2:  Under  (2.17),  the  DGT  of  the  limit  5  of  the 
iterative  algorithm  (2.12H2.14)  falls  in  the  mask  DmnxMN * 
i.e. 


Gmkxls  =  DmkxmnGmnxls-  (2.18) 


m.  Denoising  for  Received 
Signals  Through  a  Noisy  Channel 

In  this  section,  we  want  to  do  noise  reduction  with  the  time- 
variant  filter  studied  in  Section  II  for  received  signals  in  a 
noisy  channel. 


A.  Some  Parameters 

The  signal  length  is  randomly  chosen  as  500.  The  signal 
x[n]  for  the  transmitter  is 


x[n]  =  cos 


n+  15]4N\ 

150  J  J  ’ 


n  =  0,1,  ■••,499.  (3.1) 


The  above  results  say  that  as  long  as  (2.17)  on  the  analysis 
and  synthesis  window  functions  is  satisfied,  the  iterative 
algorithm  converges,  and  the  limit  signal  has  the  desired  TF 
characteristics,  i.e.,  its  DGT  falls  in  the  desired  mask.  One 
might  ask  whether  it  violates  the  known  fact  that  an  image 
of  a  TF  transform  of  a  signal  in  the  TF  plane  cannot  be 
compactly  supported.  This  is  because  a  signal  cannot  be  time- 
and  bandlimited  simultaneously.  To  answer  this  question,  we 
first  need  to  know  that  the  above  known  fact  is  true  for 
continuous  TF  transforms.  Moreover,  the  proof  of  the  fact  is 
based  on  the  marginal  properties  of  TF  transforms.  It  may  not 
be  true  for  discrete  TF  transforms.  In  other  words,  discrete  TF 
transforms  may  have  compact  support  [5]. 

Theorem  3:  Under  (2.17),  the  first  iteration  s\  of  the  it¬ 
erative  algorithm  (2. 12) — (2. 14)  is  equal  to  the  least-squared 
solution  in  (2.16),  i.e.,  s\  =  x. 

With  this  result,  one  will  see  later  that  the  iterative  algorithm 
(2.12X2.14)  improves  the  least-squared  solution  when  the 
number  of  iterations  increases,  and  meanwhile,  one  does  not 
need  to  compute  the  inverse  matrix  in  (2.16). 


The  waveform  and  its  Fourier  transform  X(u;)  of  the  above 
signal  x[n]  are  shown  in  Fig.  3.  Notice  that  since  the  Fourier 
power  spectrum  |X(u;)|2  will  be  used  in  the  denominator  in 
the  system  identification,  it  should  be  as  far  away  from  zero  as 
possible.  Since  the  noise-reduction  performance  of  the  time- 
variant  filtering  in  Section  II  depends  on  the  localization  of 
the  signal  in  the  TF  plane,  the  transmitted  signal  x [n]  should 
be  as  concentrated  in  the  joint  time  and  frequency  domain  as 
possible.  The  synthesis  and  analysis  window  functions  used  in  # 
this  paper  are  shown  in  Fig.  4,  where  their  lengthes  are  256. 
The  synthesis  window  function  is  just  the  Gaussian  function 
and  its  analysis  window  function  is  the  most  orthogonal-like 
solution  given  in  (2.7).  Their  difference  and  the  difference 
between  the  left-hand  side  and  the  right-hand  side  of  (2.17), 
i.e.,  the  condition  error,  are  also  shown  in  Fig.  4.  One  can 
see  that  they  almost  satisfy  (2.17).  The  time  sampling  interval 
length  AM  =  16  and  the  frequency  sampling  interval  length 
AN  =  2  in  the  discrete  Gabor  transform  and  its  inverse  in 
Section  n.  These  parameters  are  used  throughout  the  rest  of 
this  paper.  The  DGT  of  x[n]  is  shown  in  Fig.  5.  The  tail  part 
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synthesis  window  h[n]  analysis  window  ifn] 


condition  error 


k  0  0 


Fig.  4.  Synthesis  and  analysis  window  functions  and  the  condition  (2.17)  test. 
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Fig.  5.  Discrete  Gabor  transform  of  signal  «r[n]. 

of  the  DGT  in  Fig.  5  is  because  of  the  discrete  calculation 
aliasing. 

In  this  paper,  we  use  20-tap  LTI  systems  in  our  numerical 
examples,  where  the  number  20  is  just  randomly  chosen.  The 
channel  model  is 

N- 1 

y[n]  =  ^  h[k]x[n  —  k]  +  t>[n]  (3.2) 

Ar=rO 

where  N  =  20  in  the  following  numerical  examples,  t/[n]  is  an 
additive  white  Gaussian  noise  and  independent  of  the  signal 


x[n]f  and 

N- 1 

s[n]  =  Y,  "  * ]  (3-3> 

k=0 

is  considered  to  be  the  signal,  x[n]  is  the  transmitted  signal 
as  in  (3.1),  2/[n]  is  the  received  signal,  and  h[n]  is  an  LTI 
system  (or  channel)  impulse  response.  The  original  SNR  for 
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LTI  channel  h[n] 
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LTI  channel  Fourier  spectrum 


Fig.  6.  Example  of  LTI  channel  A[n].  signal  s[n],  and  received  signal  yfni 
white  Gaussian  noise.  *  &  yL  ' 


and  their  Fourier  spectrum,  where  the  SNR  = 


—4.5  dB  for  the  additive 


DGT  of  received  signal  y[n] 


the  received  signal  is  calculated  by 


10  log 


10 


/  499  \ 

/Ei*Hi2' 

n=Q _ 

499 

vEwi2, 

'  n=0  / 


In  the  following,  we  randomly  generate  the  channel  ft[n].  As 
an  example,  a  channel  Fourier  spectrum  and  received  signal 


time  waveform  y[n]  with  SNR  =  -4.5  dB  and  the  signal  s[n] 
without  noise  and  their  Fourier  spectrum  are  shown  in  Fig.  6. 
The  DGT  of  the  received  signal  r/[n]  with  -4.5  dB  SNR  is 
shown  in  Fig.  7.  In  Fig.  7,  one  is  still  able  to  see  the  chirp 
pattern  in  the  joint  time  and  frequency  plane,  although  it  is 
impossible  in  the  time  or  the  frequency  domain  alone  in  Fig.  6. 

B.  Mask  Design 

The  pattern  in  the  DGT  domain  of  the  above  signal  s[n]  in 
(3.3)  is  similar  to  the  one  for  the  signal  x{n]  in  Fig.  5.  This  is 
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not  only  true  for  this  particular  example  but  is  also  the  case  for 
our  numerous  examples.  The  reason  is  due  to  the  following 
analytic  argument. 

Assume  the  chirp  signal  x[n]  =  exp(jcn  )  for  some 
constants  r  >  2  and  c  ^  0.  Then 

s[n]  =  22  MfcMn  ~ 
k 

=  22  h[k]  exp (jc(n  -  k)T) 

=  exp  (jcnr )  22  exP  ( ic  53  ClU  **  ) 

k  \  i-o  / 

=  x[n]  22  exp  ( ic'52Cin‘kT-‘  ) 

k  V  1=0  > 

which  is  dominated  by  the  original  chirp  x[n]  for  finite  tap 
LTI  systems  h[k}.  It  is  because  that  the  highest  chirp  order  of 
s[nl,r,  and  the  corresponding  chirp  rate  are  the  same  as  those 
of  x[n],  whereas  the  chirp  order  for  the  above  multiplier  of 

x[n]  in  s[n] 

22  kik\ exp  ( ic  53  c‘n!kr  1 ) 

k  \  i=o  / 

is  only  r  -  1.  As  a  special  case,  when  r  =  2 
s[n]  =  x[n]G(2cn) 

where  G(w)  is  the  Fourier  transform  of  the  signal  /i[n]x[n] 

G(u)  =  22  exp(— j2cnA:). 

k 

When  the  channel  h{n)  has  only  a  finite  tap,  the  function  G(u>) 
is  usually  a  smooth  signal. 

Since  the  transmitted  signal  x[n]  is  known  to  both  trans¬ 
miner  and  the  receiver,  by  the  above  property  its  patternm 
the  DGT  domain  may  help  in  designing  a  mask  in  the  DO 
domain  for  filtering  noise.  This  is  exactly  the  motivation  for 
the  following  design  method  of  a  mask  DmnxMN  hi  the 
iterative  time-variant  algorithm  (2.12H2.14).  The  subscript 
MN  x  MN  of  the  mask  DmnxMN  will  be  dropped  from 
now  on  without  causing  confusion  in  understanding. 

1 )  Mask  Design  Procedure: 

Step  1)  Implement  the  DGT  C«,„  of  the  transmitted  signal 

x[A;]. 

Step  2)  Threshold  the  DGT  coefficients  Cm,n  and  have  a 
mask  Dx  from  Cm,n 


Dx{m 


if  |C(m,n)|  > 
otherwise 


to 


where  to  is  a  predesigned  positive  number  that  is 
called  thresholding  constant. 

Step  3)  Implement  Steps  1  and  2  for  the  received  signal 
«[*],  and  design  a  mask  Dy  with  thresholding 
constant  ti  from  the  DGT  coefficients  of  y[n]  with 
another  predesigned  constant  ti  >  0.  ^ 

Step  4)  The  final  mask  is  the  product  of  Dx  and  Dy:  D  = 
DxDy 

Since  the  DGT  of  the  signal  x[n)  usually  dominates  the 
DGT  of  the  signal  s[n],  the  pattern  in  the  DGT  domain  of 


the  signal  s[n]  is  usually  in  a  close  neighborhood  of  the 
pattern  in  the  DGT  domain  of  x[n].  Therefore,  the  mask  Dx 
is  usually  designed  so  that  it  covers  a  relatively  large  area, 
i  e ,  the  thresholding  constant  t0  in  Step  2  is  usually  chosen 
not  too  large.  Since  the  received  signal  y[n]  is  from  a  noisy 
channel,  the  resolution  of  its  DGT  pattern  may  be  reduced, 
and  therefore,  the  thresholding  constant  h  in  Step  3  is  usually 
chosen  to  be  not  too  small.  Otherwise,  the  mask  Dy  will  cover 
too  much  unwanted  area.  Let  us  see  an  example.  The  mask  Dx 
from  x[n],  the  mask  Dy  from  y[n],  their  product  D  -  DxUy, 
and  the  mask  Ds  from  the  true  signal  s[n]  are  shown  in 
Fig.  8,  respectively.  The  SNR  in  this  case  is  SNR  ^  • 

dB  The  thresholding  constants  in  Steps  1-3  are  t0  - 
and  h  =  0.15  •  max(DGT (y)).  It  should  be  pointed  out  that 
the  above  mask  design  procedure  may  be  improved  by  using 
more  sophisticated  designs.  Possible  improvements  are 

i)  to  find  the  optimal  thresholding  constants  to  anc*  ti  by 
training  a  large  number  of  signals  and  systems; 

ii)  to  use  more  sophisticated  statistical  detection  method 
in  the  DGT  domain  for  the  received  signal  i/[n]  instead 
of  a  simple  thresholding  in  Step  3; 

in)  to  smooth  the  mask  D  =  DxDy  since  the  true  mask  Ds 
is  usually  smooth  due  to  the  nature  of  a  chirp  signal, 
but  Dy  from  the  noisy  signal  y[n ]  may  not  be  smooth. 
Some  morphological  operations,  such  as  dilation,  may 
be  used  to  smooth  the  mask  D. 

Another  observation  from  our  various  numerical  examples 
is  that  the  mask  Dx  is  the  mean  of  the  true  mask  Ds  in  terms 
of  different  LTI  systems  h[n}. 

C.  Denoising  Experiments 

In  this  subsection,  we  want  to  implement  the  time-variant 
filtering  algorithm  in  Section  II  with  three  masking  techniques: 
using  the  mask  D  =  Dx  from  the  transmitted  signal,  using 
the  mask  D  =  DyDx  as  designed  by  Steps  using  the 
true  mask  D  =  D,.  We  run  100  tests  in  terms  of  different 
LTI  systems  h[n]  (randomly  generated)  and  different  additive 
white  Gaussian  noises  v[n}  for  each  masking  meth  an  e 
their  mean  SNR.  Nine  iterative  steps  are  used  m  the  iterative 
algorithm  (2.12M2.14).  Fig.  9  shows  the  curves  of  the  mean 
SNR  versus  iterative  steps  for  the  three  masking  meth  . 

First,  we  analyze  the  time-variant  filter  (2-12)  (2.1  )  wi 
the  mask  D  =  Dx.  From  Fig.  9,  the  SNR  drops  after  the 
second  iteration.  This  is  because  the  mask  we  used  is  D  - 
Dx,  which  matches  the  transmitted  signal  x[n]  mid  not  slnj. 
Although  there  is  a  similarity  (see  Fig.  8)  in  the  TF  plane 
between  the  DGT  of  x[n]  and  the  DGT  of  s[n],  they^are 
not  equal.  The  similarity  is  exactly  the  reason  why  the  SNR 
increases  significantly  in  the  first  and  the  second  iteration  step. 
The  difference  between  x[n]  and  s[n]  causes  the  SNR  to  op 
after  the  second  iteration.  Notice  that  the  mask  D  -  Vx  is 
known  to  the  receiver,  and  it  is  a  good  candidate  m  the  time- 
variant  filtering  if  the  iterative  algorithm  stops  at  the  second 

iteration  step.  ,  n  -  n  n 

We  now  analyze  the  performance  of  the  mask  U  —  vx  y- 
This  mask  rejects  a  lesser  portion  of  the  noise  outside  Ds  than 
Dx  alone  does,  when  the  first  thresholding  constant  f0  for  Ux 
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Fig.  8.  Example  of  masks  Dx  from  j[n],  Dy  from  y[n],  the  final  mask  D  =  DxDy.  and  the  true  mask  Ds  from  s[n],  where  the  SNR  =  -1.4  dB. 

mean  SNR 


Fig.  9.  Mean  SNR  curves  of  the  iterative  time-variant  filtering  with  the  following  masks:  D  =  Dx.  D  =  DxDy,  and  D  =  Ds. 


in  Step  2  is  less  than  the  one  in  designing  Dx  alone.  The  reason 
why  this  t0  for  D  should  not  be  large  is  for  the  conservation 
because  the  mask  Dx  is  multiplied  by  Dy  in  designing  D. 
It,  however,  happens  because  the  beginning  SNR’s  are  not  as 
high  as  the  ones  in  the  time-variant  filtering  with  the  mask  Dxy 
which  is  shown  by  the  solid  line  in  Fig.  9.  Since,  in  general, 
D  =  DxDy  covers  relatively  more  signal  information  than 
Dz  alone  does,  the  SNR  increases  when  the  iteration  number 
increases. 

The  third  masking  D  =  Ds  method  is  the  ideal  case.  With 
this  ideal  mask,  about  an  1 1  dB  SNR  increase  with  the  iter¬ 


ative  time-variant  filtering  over  the  original  SNR  is  achieved  * 
consistently.  Notice  that  by  Theorem  3,  the  first  iteration  is 
equal  to  the  conventional  least  squared  solution.  The  iterative 
time-variant  filtering  outperforms  the  least  squared  solution 
by  about  3  dB. 

To  improve  the  performance  of  the  iterative  time-variant 
filtering,  what  one  can  do  further  is  to  use  more  sophisticated 
methods  to  detect  Dx  and  Dyy  in  particular  Dy,  so  that 
their  product  D  =  DxDy  is  as  close  to  Da  as  possible. 
Besides  what  has  been  mentioned  in  the  previous  subsections, 
directly  minimizing,  the  difference  between  D  =  DxDy  and 
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DGT  of  y[n]  mask  D=DxDy 


20  40  60  20  40  60 


Fig.  10.  DGT  of  t/[n]  with  noise  and  s[n]  without  noise  and  their  corresponding  masks  (original  SNR  =2.7  dB). 


new  system  identification  method 


Fig.  1 1 .  New  system  identification  method. 


Ds  with  training  signals  is  another  potential  approach.  When 
the  original  SNR  is  not  too  low,  the  chirp  pattern  of  s[n]  can 
usually  be  seen  clearly  in  the  DGT  domain  of  the  received 
signal  y[n\ .  An  example  is  shown  in  Fig.  10,  where  the  original 
SNR  =  2.7  dB. 


The  conventional  system  identification  method  used  here  is 
the  cross-spectral  method 

Hnew{uj)  =  sSm  "  (41) 


IV.  System  Identification 

In  this  section,  we  first  use  the  iterative  time-variant  filter 
(2.12M2.14)  developed  in  the  previous  sections  to  reduce 
the  additive  white  Gaussian  noise  v[n]  from  the  received 
signal  y[n).  In  the  iterative  time- variant  filter,  for  calculation 
simplicity,  we  choose  the  first  masking  method  studied  in 
Section  III-C,  i.e.,  the  mask  D  =  Dx,  for  all  calculations 
in  this  section.  With  this  mask,  two  iterations  are  used  in 
the  time- variant  filter  in  Section  EI-B.  We  then  implement 
the  conventional  system  identification  method,  as  shown  in 
Fig.  11. 


where  x[n]  is  the  chirp  signal  defined  in  (3.1).  It  is  compared 
with  the  conventional  method  without  denoising,  i.e.. 


tfoldi  (w) 


Syx(v) 

SXx{v) 


(4.2) 


where  x[n]  is  also  the  chirp  signal.  Since  the  system  iden¬ 
tification  performance  usually  depends  on  the  signal  x[n] 
transmitted,  one  might  say  that  it  is  not  fair  to  compare  them 
using  the  chirp  signal  that  is  preferred  here  for  denoising  but 
might  not  be  preferred  for  other  methods.  For  this  reason,  we 
also  compare  our  new  method  with  the  conventional  method 
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Fig.  12.  Comparison  of  system  identification  methods.  The  conventional  method  using  chirp  signals;  the  conventional  method  using  pseudo-random  signals; 
new  method  using  chirp  signals,  and  time-variant  filtering. 


original  00  noise 


old  method  new  method 


Fig.  13.  System  identification  examples:  Original  spectrum  |Jf(w)|:  identified  spectrum  without  additive  noise  using  the  chirp  signal;  conventional  method 
with  additive  noise  of  SNR  =  -0.4  dB;  new  method  with  additive  noise  of  SNR  =  -0.4  dB. 


using  pseudo-random  sequences 


■Holds  (w) 


Syxjul) 

Sii(w) 


(4.3) 


where  x  [n]  is  a  pseudo-random  sequence. 

Fig.  12  shows  their  performances,  where  200  tests  are  used 
for  the  mean  SNR  curves  for  the  system  spectrum  versus  the 
original  SNR.  Our  new  method  performs  much  better  than 
others.  Surprisingly,  even  for  the  conventional  cross  spectral 


method,  the  chirp  signal  in  (3.1)  outperforms  pseudo-random 
signals  by  about  6  dB.  In  Fig.  13,  some  identification  examples 
are  shown,  where  the  original  SNR  is  —0.4  dB.  As  a  remark, 
all  system  identification  calculations  used  in  this  paper  are 
based  on  the  Matlab  Signal  Processing  Toolbox. 

V.  Conclusion 

In  this  paper,  we  proposed  a  system  identification  method. 
The  proposed  method  is  based  on  transmitting  chirp  signals 
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crosscorrelation  between  x[n]  and  original  noise 


crosscorrelation  between  x{n]  and  new  noise 


Fig.  14.  Cross  correlations  between  the  new  noise  r[n]  (SNR  =  0.74  dB)  and  the  signal  j-[n]  and  the  original  noise  t>[n]  and  the  signal  x[n]  (SNR  =  -6.4  dB). 


and  denoising  followed  by  the  conventional  identification 
method.  The  denoising  method  is  based  on  time-variant  filter¬ 
ing  in  the  joint  time-frequency  (TF)  domain.  Since  transmitted 
signals  are  chirp-type  signals,  they  are  well-localized  in  the 
TF  domain,  and  one  is  usually  able  to  see  their  patterns  in 
the  TF  domain,  even  in  a  very  low  SNR  environment-  Due  to 
.this  property,  a  significant  SNR  increase  after  a  time- variant 
filtering  can  be  achieved.  Our  numerical  simulations  were 
performed  to  illustrate  this  theory.  The  simulations  done  in  this 
paper  were  used  simply  for  showing  the  potential  performance 
of  the  new  approach  based  on  time-frequency  analysis  and 
synthesis  techniques  in  very  low  SNR  environment.  Several 
further  improvements  are  possible.  They  are 

i)  to  use  more  sophisticated  detection  methods  in  design¬ 
ing  masks  D  for  the  iterative  time-variant  filter, 

ii)  to  search  the  optimal  reference  signal  x[n]  so  that  its 
Fourier  spectrum  is  as  far  away  from  0  as  possible  and 
it  localizes  in  the  TF  domain  as  much  as  possible; 

iii)  to  use  more  sophisticated  existing  system  identification 
methods,  such  as  the  method  recently  proposed  in  [1] 
by  Shalvi  and  Weinstein,  where  the  additive  noise  v[n] 
in  the  system  model  is  not  necessarily  independent  of 
the  signal  x[n]. 

The  reason  for  mentioning  iii)  here  is  because  of  the  fol¬ 
lowing  argument.  Since  a  joint  TF  domain  filter  that  usually 
depends  on  the  signal  x[n]  is  used,  the  new  noise  v[n]  after 
denoising  and  the  transmitted  signal  x[n]  may  have  similar 
TF  characteristics,  and  therefore,  they  may  be  correlated,  in 
particular,  when  the  original  SNR  is  too  low.  Such  an  example 
is  shown  in  Fig.  14,  where  the  original  SNR  =  —6.4  dB 
and  the  SNR  =  0.74  dB  after  the  second  iteration  of  the 
time- variant  filtering.  From  Fig.  14,  one  can  clearly  see  that 
the  correlation  between  the  new  noise  v[n]  after  denoising 
and  the  signal  x[n]  exists,  whereas  it  does  not  exist  between 


the  original  noise  t/[7i]  and  x[n].  It  should  be  observed  from 
our  numerous  numerical  examples  that  this  phenomenon  only 
happens  when  the  original  SNR  is  very  low. 
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A  Family  of  Pulse-Shaping  Filters  with  ISI-Free 
Matched  and  Unmatched  Filter  Properties 

Xiang-Gen  Xia 


Abstract — The  raised-cosine  pulse-shaping  filter  plays  an  Im¬ 
portant  role  in  digital  communications  due  to  intersymbol 
interference  (lSI)-free  property. The  ISI-free  property  holfcafter 
matched  filtering  is  perform**.  In  this  P W 

new  family  of  pulse-shaping  filters.  These  filters  are  ISI  Tree 
Jith  or  without  matched  filtering.  Using  these  new  pulse-shaping 
filters,  the  computational  load,  and  therefore  ^hardware  cost 
in  demodulation  for  modem  design,  might  be  reduced  in  some 
applications. 

Index  Terms —  ISI-free  property,  matched  and  unmatched  fil- 
tering,  pulse-shaping  filters. 

I.  Introduction 


l  HE  raised-cosine  filter 
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plays  an  important  role  in  digital  communication  systems.  It 
has  been  used  extensively  in  modem  design  for  both  wireline 
and  radio  systems.  This  is  mainly  due  to  its  intersymbol 
interference  (ISI)-free  property,  i.e., 

f  1,  n  =  0 

/i(nT,)  =  6(n)  =  n  =  ±l.±2.-- 

where  H(u>)  and  h(t)  are  the  frequency  and  the  time  response 
functions,  respectively.  There  have  been  extensive  discussions 
of  this  topic;  see,  for  example,  [1H4]- 

Since  the  ISI-free  property  holds  after  the  matched  filtering 
is  performed  for  the  received  signal,  the  frequency  response 
G(oj)  of  the  transmitted  waveform  g(t)  should  be  the  square 
root  of  H{u>)  in  (1),  i.e., 

G{uj)  =  s/Wuj)  and  g[t)  =  f~l(G(w))  (2) 

where  T  stands  for  the  Fourier  transform  and  T~x  means 
its  inverse.  The  matched  filtering  plays  two  roles  here.  One 
is  low-pass  filtering  that  reduces  the  noise,  and  the  other 
is  ISI  reduction  due  to  the  ISI-free  property  of  the  raised- 
cosine  filters.  Since  the  length  of  these  filters  is  not  short,  the 
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hardware  implementation  cost  in  current  modem  systems  is 
significant.  However,  it  may  occur  in  practice  that,  for  some 
users,  the  matched  filtering  is  used  purely  for  reducing  the 
ISI.  In  this  case,  if  the  transmitted  signal  is  already  ISI  free, 
the  matched  filtering  may  not  be  necessary.  The  question  then 
becomes  whether  it  is  possible  to  construct  pulse  shaping  filters 
G(u>)  at  the  transmitter  so  that  both  the  transmitted  signal  and 
the  signal  after  matched  filtering  are  ISI  free,  i.e., 
g(nT,)  =  S(n)  and  h(nT ,)  =  8(n) 

where  h(t)  is  the  time-domain  waveform  of  H(u)  =  |<j(w)|2. 

In  this  letter,  we  will  positively  answer  this  question  by 
proposing  a  family  of  such  pulse-shaping  filters. 

II.  A  New  Family  of  Pulse-Shaping  Filters 

In  this  section,  we  present  a  new  family  of  real-valued  pulse¬ 
shaping  filters  which  have  ISI-free  properties  with  or  without 
matched  filtering. 

Let  g(t)  denote  the  waveform  in  the  time  domain  to  be 
transmitted,  and  let  G(*i)  denote  its  Fourier  transform.  Let 
/,(/)  be  the  waveform  in  the  time  domain  after  the  matched 
filtering  of  g(t)  is  performed,  and  let  H(uj)  denote  its  Fourier 
transform.  Then,  H( u>)  =  |G(u»)|2.  Without  loss  of  generality, 
from  now  on,  we  assume  T,  =  1.  The  ISI-free  property  for 
the  waveform  g(t)  is 

g(n)  =  6{n),  ne  Z 

where  Z  is  the  set  of  all  integers.  This  is  equivalent  to 

G(u>  +  2ntr)  =  1.  (3) 

n 

The  ISI-free  property  for  the  waveform  h(t)  is 
h(n)  =  6(n),  n  €  Z 

which  is  equivalent  to 

|G(u>  +  2n*)|2  =  1.  (4) 

n 

We  want  to  construct  real-valued  g(t)  that  satisfies  (3)  and  (4). 
Let  v{x )  be  a  continuous  function  such  that 

x  <  0 
x  >  1 

V  — 

and 


■<*>-{?: 


(5) 


i/(x)  +  i/(l  -  *)  =  1,  x€R 


(6) 


where  R  is  the  set  of  all  real  numbers.  An  example  of  such 
a  t/(x)  is 

(0,  x<0 

t/(x)  =  {  x4(35  -  84x  +  70x2  -  20x3),  0  <  x  <  1 

1 1,  x 

(7) 
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which  has  almost  fourth-order  smoothness.  The  simplest  form 
for  such  v  is  T  <  o 

t/(j)  =  |x.  0<x<l 

ll.  X  >  1 

which  is  only  continuous,  but  not  differentiable. 

We  determine  g(t)  by  constructing  its  Founer  transform 

G(  u>): 


G(u>)  = 
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(8) 

Notice  that  the  parameter  function  v  controls  the  width  of 
the  transfer  band  of  the  filter  G(u>).  The  smoothness  of  the 
function  „  determines  the  speed  of  the  waveform  decay  of . 9(f) 
in  the  time  domain,  i.e.,  the  length  of  the  filter.  The  smoother 
u  is  implies  the  shorter  the  filter  g(t)  will  be-  R 

Theorem  l:  The  pulse-shaping  filters  g{t)  defined  by  (8) 
satisfv  the  following  properties. 

1)  They  are  real  valued. 

2)  They  are  ISI  free  by  themselves,  i.e..  g(n)  -  6(n). 

3)  They  are  ISI  free  after  matched  filtering  is  performed. 

i.e.,  h(n)  =  £(»)•  _ 

Proof:  To  prove  1),  we  only  need  to  prove  G*(-w)  = 
G(w)  for  2x/3<M<4ir/3 


?s 


Fig.  I. 
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The  frequency  responses  \HW)\  and  |G(--)I  for  the  new  pulse 
and  the  raised  cosine  filters  with  o  _  1/3. 
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=  i(l+e^(3/2*V-i)) 


e-ijri/((3/2ir)(-w+2»)-l)^ 
c-»M2-(3/2ir)w)^ 
e-i*(l-i/(-l  +  (3/2*)w))^ 


=  G(u>) 

where  step  1  is  from  (6).  c  r<t  ,\ 

To  prove  2),  we  only  need  to  prove  (3).  The  form  of  G(  ) 

in  (8)  satisfies  (3)  for  2*/3  <  u»  <  4rr/3 

y;  G(u>  +  2 fix)  =  G(oj)  +  G(u  -  2k)  =  1. 

n 

This  proves  2).  n 

The  property  3)  can  be  similarly  proved.  u 

The  frequency  responses  H(u>)  and  G( u>)  for  the  above  new 
pulse-shaping  filters  in  (8)  with  the  v  function  in  (7),  and  the 


raised-cosine  filter  with  a  =  1/3  in  (1)  and  its  square  root  are 
illustrated  in  Fig.  1. 

HI.  Conclusion 

In  this  letter,  we  proposed  a  new  family  of  pulse-shaping 
filters.  These  pulse-shaping  filters  are  ISI  free  with  or  without 
matched  filtering  at  the  receiver.  This  property  may  reduce 
the  hardware  cost  in  designing  modem  systems  in  some  appli¬ 
cations  where  the  low-pass  (bandpass)  filtering  is  performed 
before  the  matched  filtering.  It  should  be  noticed  that,  although 
the  new  pulse-shaping  filters  are  real  valued,  they  are  not  linear 

phase. 
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Abstract  In  this  paper,  we  consider  the  problem  of  extrapolation  of  a  band-limited  signal 
outside  a  fixed  interval  from  its  (approximate  or  contaminated)  values  in  that  interval.  We 
propose  a  new  extrapolation  method  that  estimates  the  error  between  the  extrapolated  and 
true  values,  and  which  also  resolves  the  ill-posedness  of  the  problem.  The  method  is  called 
a  modified  minimum  norm  solution  (MMNS)  method.  Both  the  continuous  MMNS  and  its 
discretization  are  studied.  The  error  estimates  hold  for  some  classes  of  band-limited  signals, 
when  the  maximum  magnitude  of  the  data  error  is  known.  These  classes  of  band-limited  signals 
are  also  characterized. 


1.  Introduction 

Let  /  be  a  finite  energy  signal,  i.e.  /  e  L2(R).  Its  Fourier  transform  /  is  defined  by 

/OO 

dr.  (1.1) 

-OO 

If  there  exists  a  positive  number  Q  such  that  f(co)  =  0  when  \co\  >  S2,  /  is  called  £2  band 
limited.  An  Q  band-limited  signal  /  can  be  represented  by  its  inverse  Fourier  transform: 


It  is  known  (see  for  example  [1])  that  a  band-limited  signal  /  is  the  restriction  to  the  real 
line  R  of  an  entire  function  defined  on  the  complex  plane  C.  Therefore,  in  theory,  /  is 
determined  everywhere  by  its  values  on  an  interval  no  matter  how  small  this  interval  is. 
This  motivates  the  following  band-limited  signal  extrapolation  problem. 

How  does  one  practically  extrapolate  an  Q  band-limited  signal  f  outside  an  interval 
[-7,  7]  when  f(t)  is  given  for  t  e  [-7,  7]  with  a  certain  contamination  error ? 

The  above  extrapolation  problem  is  interesting  not  only  in  theory  but  also  in  many 
applications,  such  as  spectral  estimation  (Papoulis  [25])  and  limited-angle  tomography 
in  medical  image  reconstruction  (Natterer  [24]),  where  only  limited  observation  data  are 
available. 

Since  /  is  analytic,  a  trivial  solution  for  the  problem  is  to  compute  the  derivatives  /(n) 
at  t  =  0  by  using  the  values  of  /  in  [-7,  7]  and  then  use  the  Taylor  expansion.  However, 
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this  method  is  extremely  unstable  due  to  the  instability  of  the  derivative  computations. 
Numerical  differentiation  is  an  ill-posed  problem  and  the  degree  of  ill-posedness  (which  can 
be  made  precise  using  Sobolev  negative  norms)  increases  with  the  order  of  differentiation. 
Therefore,  researchers  have  been  seeking  other  methods.  Since  the  early  1970s  there  has 
been  considerable  interest  in  this  area,  for  example  [4-8, 1 1-17, 24-30, 32-36, 38-40].  Since 
the  problem  itself  is  basically  an  inverse  problem,  it  has  been  recognized  that  the  existing 
extrapolation  methods  are  generally  unstable  in  terms  of  inaccurate  data.  The  extrapolated 
values  can  change  dramatically  when  the  given  data  in  an  interval  change  slightly,  see 
for  example  [27].  There  are  also  many  modified  algorithms  that  have  been  proposed  to 
improve  the  extrapolation  performance.  However,  to  the  best  of  our  knowledge  there  is  no 
extrapolation  algorithm  with  which  one  is  able  to  estimate  the  error  between  the  extrapolated 
and  true  values  outside  the  given  interval  [— 7\  T]  for  any  nontrivial  class  of  Q  band-limited 
signals,  when  the  given  data  are  inaccurate. 

In  this  paper,  we  propose  a  new  extrapolation  method  for  band-limited  signals  that  we 
call  a  modified  minimum  norm  solution  (MMNS)  method.  With  the  MMNS  method  we 
are  able  to  estimate  the  error  between  the  extrapolated  and  true  values  for  some  nontrivial 
classes  of  band-limited  signals,  when  the  maximum  magnitude  of  the  error  of  the  given 
inaccurate  data  in  a  certain  interval  is  known.  This  paper  is  organized  as  follows.  In 
section  2  we  study  the  MMNS  method  for  continuous-time  signals.  In  section  3  we  study 
the  MMNS  method  for  discrete-time  signals,  which  is  a  discretization  of  the  method  in 
section  2.  In  section  4  we  present  tractable  characterizations  of  the  classes  of  band-limited 
signals  studied  in  sections  2  and  3.  In  section  5  we  make  several  remarks. 


2.  Band-limited  signal  extrapolation  in  the  continuous-time  domain 

In  this  section,  we  study  the  MMNS  method  for  continuous-time  band-limited  signals. 
Without  loss  of  generality,  in  what  follows  we  assume  Q  =  2n  and  T  =  1  although  we 
continue  to  use  Q  and  T  to  emphasize  where  they  appear.  We  also  assume  /«  =  /  +  >? 
where  rj  is  the  error  signal  that  is  continuous  in  time  and  \rj(t)\  ^  e  for  t  e  [—7\  7*], 
and  f€(t)  for  t  e  [-7\  T]  are  the  given  data.  By  normalization,  we  may  assume  that  the 
maximal  error  magnitude  eel. 

We  first  introduce  some  notation.  Let  L2[-£>,  D ]  denote  the  space  of  all  signals  /  that 
satisfy 

aD  \  1/2 

D\f(t)\2dtj  <  oo 
where  D  is  a  positive  number  or  oo. 

Let  BC  denote  all  £2  band-limited  signals.  For  y  ^  0,  let  BCy  denote  all  £2  band-limited 
signals  /  e  BC  that  satisfy  the  following  condition. 

For  any  <5  >  0,  there  exists  a  signal  g&  6  L2[— T,  T ]  such  that 

satisfies  the  following  two  properties: 

\\f-fs\\m^S  (2.2) 


and 


II /ill  (oo  <  c&~y 


(2.3) 
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where  C  is  a  constant  that  is  independent  of  8  and  y,  and  /  is  the  Fourier  transform  of  /. 

The  physical  meaning  of  the  above  subspace  of  all  £2  band-limited  signals  is  as  follows. 
For  an  £2  band-limited  signal  /,  its  Fourier  transform  /  is  supported  in  [— £2,  £2]  and 
/  e  L2[— £2,  £2].  The  correspondence  between  the  space  BC  of  all  £2  band-limited  signals 
and  the  space  L2[-£2,  £2]  of  all  finite  L2  norm  signals  defined  on  [-£2,  £2]  is  one-to-one 
and  onto.  Therefore,  for  a  general  £2  band-limited  signal  /  its  Fourier  transform  /  may 
not  have  any  smoothness  property.  The  subspace  BCy  contains  all  £2  band-limited  signals 
/  with  the  following  properties^ 

(i)  The  Fourier  transform  /  can  be  approximated  in  the  L2  sense  by  a  family  {/$}  of 
7  band-limited  signals  (entire  functions  of  exponential  order).  This  approximation  holds 
inside  the  frequency  band  of  /,  i.e.  the  support  [-£2,  £2]  of  /. 

(ii)  The  L2  norms  on  the  whole  real  line  of  the  signals  in  the  family  {/$}  may  not  be 
uniformly  bounded,  but  the  rate  of  the  divergence  is  not  arbitrary.  Rather  the  rate  is  related 
to  the  rate  of  the  convergence  of  {/$}  in  L2[— £2,  £2]  to  /  as  8  ->  0. 

In  this  approximations  framework,  what  is  gained  is  the  smoothness  while  what  is  lost 
is  the  boundedness  of  the  family  of  L 2  norms  on  the  real  line.  This  trade-off  is  similar 
to  the  bandwidth  and  the  timewidth  trade-off  [29,30].  More  precise  interpretation  and 
characterization  of  the  above  subspace  will  be  given  in  section  4. 

For  the  maximal  error  magnitude  €  mentioned  at  the  beginning  of  this  section  and  any 
number  k  ^  0,  let  BT€tk  denote  the  set  of  all  signals  g  e  L2[— 7,  7]  such  that 

^  A  fort  €[-7,7].  (2.4) 

The  basic  idea  for  this  subspace  is  to  find  signals  in  a  neighbourhood  of  the  inaccurate  data 
signal  f€(t)  for  t  €  [-7,  7]  such  that  the  Fourier  transforms  of  these  signals  are  7  band 
limited. 

For  k  >  let  g€>x  be  the  unique  element  (the  existence  and  uniqueness  will  be  shown 
in  lemma  2)  in  BT€tx  that  has  the  minimum  norm: 

ll£e,J(r)  =  mindly || (Dig  eBT(,x}.  (2.5) 


sin27r(^  —  t) 


\2n2  J-1 


g(s)ds  -  f€(t)\ 


which  is  called  the  MMNS  of  the  continuous-time  band-limited  signal  extrapolation  problem. 
We  now  have  the  following  error  analysis  for  the  above  MMNS. 


Theorem  1.  Let  f€t ^  be  defined  by  (2.6)  with  the  constant  k  =  2c.  If  f  €  BCy  for  some 
number  y  with  0  ^  y  <  j,  then 


\f<Mt)  ~  fU)\  <  C€(,-2>')/3  for  all  t  €  R 


(2.7) 


where  C  is  a  constant  independent  of  €  and  y. 


Before  we  prove  theorem  1,  we  establish  two  lemmas.  We  first  recall  the  following 
known  results  from  operator  theory  of  ill-posed  problems.  Let  Mi  and  H2  be  two  Hilbert 
spaces,  and  AT  be  a  bounded  linear  operator  from  Hi  to  H2.  Let  K*  denote  the  adjoint  of 
the  operator  K  and  K *  be  the  generalized  inverse  of  K  (see  [9, 19,20]).  Let  7l(K*)  denote 
the  range  of  the  operator  K *. 

We  recall  that  the  (Moore-Penrose)  generalized  inverse  K *  of  the  operator  K  is 
characterized  by  the  following  extremal  property.  For  any  g  in  the  domain  V(K^)  = 
7l(K)  +  Ti(K)L,  the  element  K^g  is  the  minimal  norm  least-squares  solution  of  the  operator 
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equation  Kf  =  g.  If  TZ(K)  is  nonclosed,  which  is  the  case,  for  example,  when  AT  is  a 
compact  operator  with  infinite-dimensional  range,  then  the  operator  &  is  unbounded,  so 
the  problem  is  ill-posed.  The  well  known  Tikhonov  regularization  uses  the  approximation 

xa  =  (K*K+aI)-'K*g  a  >  0 

where  1  is  the  identity  operator.  It  is  well  known  that 

lim  x„  =  K^g  for  g  € 

a-*0 

Without  any  ‘smoothness’  assumption  on  K^g,  it  is  not  possible  in  general  to  estimate  the 
rate  of  convergence  of  xa  to  K^g  or  to  obtain  an  error  estimate  ||xa  —  K^g\\  for  fixed  a  >  0. 
In  what  follows  we  will  use  the  following  proposition  (see,  e.g.,  [10, 18])  which  states  that 
if  K' g  e  7Z(K*),  a  kind  of  smoothness  condition,  then  an  error  estimate  holds. 

Proposition  1.  If  K^g  e  7l(K*),  say  K^g  =  K*g*  for  some  g*  e  Hfe,  then 

Let  us  consider  the  operator  F~l  from  L2[-£2,  £2]  to  L2[-T,  T],  a  restriction  of  the 
inverse  Fourier  transform  (1.2),  defined  by: 

=  /(f)  =  t“  f  f(.a))e-i,wdco  t  e  [-T,  TJ.  (2.8) 

I7t  J_a 

Then  its  adjoint  is 

1  CT 

[(F'_1)*g](a))  =  —  J  g(s)eisa>ds  co  €  [-£2,  £2]. 

From  (2.8),  (F~[  f)(t)  =  0  for  almost  all  t  e  [-T,  T]  if  and  only  if  f(co)  =  0  for 
almost  all  co  e  [—£2,  £2].  This  implies  that  the  null  space  Jf(F~l)  of  the  operator  F-1 
is  the  zero  element.  This  also  implies  that  the  space  7^((F-1)*)  is  dense  in  Lz[-£2,  £2] 
since  Closure(W((F-')*))  =  TVfF-1)1  =  L2[-£2,  £2].  Thus  we  have  proved  the  following 
lemma. 

Lemma  1.  For  any  8  >  0,  there  exists  gs  €  L2[-T,  T]  such  that 
11/  -  /all(fl)  < 


where 

Mco)  =  T  J  gs(s)elswds 
and  f  is  the  Fourier  transform  of  f. 

By  lemma  1  and  its  implication  in  the  time  domain,  it  is  clear  that  the  set  BT(X  defined 
by  (2.4)  is  not  empty  when  A  >  e.  Since  the  set  BTf  l  is  closed  and  convex,  we  have 
proved  the  following. 

Lemma  2.  For  X  >  e,  there  is  a  unique  element  g€tX  in  BT(  X  such  that 
llg«ull(r)  =  min{||g||(r)  :  g  e  BT,X }. 
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With  the  function  g€ix  as  in  lemma  2,  define 

ge.x(0>)  =  j  ^ge,. i(s)e“"dj. 

Then  the  MMNS  f(,-A  in  (2.6)  can  also  be  represented  as 


(2.9) 


Ux(t) 


=  —  f 

2  n  J_ 


5*.x(w)e  ,scoda). 


(2.10) 


With  the  signal  fs  in  (2.1),  define 

hit)  =  f  T  Ms)t-',s,ds. 

We  are  now  ready  to  prove  theorem  1. 

Proof  of  theorem  1.  When  /  e  BCy  for  y  >  0,  by  (2.1),  (2.2)  the  signal  gs  with 
<5  =  (2n/V2Q)e  satisfies 


2n 

V2Q( 


where  fs  is  related  to  gs  via  (2.1).  In  the  time  domain,  by  using  the  Cauchy-Schwarz 
inequality  and  the  above  inequality  we  have 


1/(0  -  /i(OI  <  2. 


r if  ia)- hio>))z-'ua>dco 


<  € 


where 


fs((D)e~ua)dco 


m~kL 

(2J)J_  f°  J_  fT 
2 7T  J^q  2tt  J_j 


gs{s)t^s  f)d5do> 
1  fT  sin2?r(5  —  t) 


-hi 


-gi(Ods 


_f  s  —  t 

where  the  convention  Q  =  2;r  made  at  the  beginning  of  this  section  is  used.  By  the 
assumption 

l/<(0“/(0K* 

we  have 

\fs(t)-Mt)\  ^2e. 

According  to  (2.4),  we  have  proved  that  gs  is  in  BT( >2*.  Hence,  by  lemma  2  we  obtain 

\\g(.2*  11(7)  ^  Il5(2*/720)*lla> 

Moreover,  by  (2.1)  and  (2.3),  we  have 

ll^<.2*ll(r)  ^  11^/720)*  11(7-)  <  2nC(2n/V2Q)-y€-y. 


Since 


we  have 


l/*,2*(0-/*(0K2f 

l/*.2*(0-/(0K3e 


t  €  [ -T .  T] 
t  6  [-T,  T], 
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For  the  signal  f$  in  (2.10)  and  considering  (2.2)  in  the  time  domain,  we  have 

J7£i 

\m  -  fit)  |  sj  —5  for  t  e  R. 

Therefore, 

/^Q 

l/e, 2,(0  -  /*(»)!  ^  +  —5  for  t  e  [-T,  T], 

For  a  >  0,  let 


(2.11) 


Jca  =  ((F-1)*^”1  +a/)"1(F"1)*(/et2f(0  “  /«(0). 

By  using  proposition  1  with  K  =  F-1  and  S  =  e,  and  (2.1),  (2.2),  we  have 

|||€,2€  “  /*  “  -^cr  II  (12)  =  II  *f(/€>2c  -  fs )  -  Xa||(Q) 

^  Va(ll^e,2<ll(r)  +  II&5  11(d) 

^  2nC€~Y Va, 

where  C  is  a  constant,  and  g€,2*  ~  fs  ^  K*(g€t2€  —  £a)  from  (2.1)  and  (2.9).  On  the  other 
hand, 

\  _  rygg  /  V2^\  £ 


Thus, 


\Tj2Sil  */2n\  TV2Ci  (. 

I*.!,®  < ;-j—  (3f  +  -JT5)  -  -IT"  (_ 

11,*  -  Mm  «  2,c«->\C+  ^  (s  +  ;• 


2  n  t  a 


Using  (2.2)  with  <5  =  ey  we  have 

!lle,2e  -  /B(O)  <  2nCe~y  */a  + 


tV2Q  /  V2n\  £ 

“V  + 


)«+6- 


In  the  time  domain,  using  the  Cauchy-Schwarz  inequality,  we  obtain 
a/2S2 


l/e.2e(0  - /(Ol  < 


27T 


„  _  Tsfm  /  V2fi\  e 

2nCe~Yy/a  + -  3  +  ——  -  +  e 

n  \  In  f  a 


for  t  €  R. 


Therefore,  estimate  (2.7)  in  theorem  1  can  be  proved  by  taking  a  =  e2(l+>/,/3  and  using  the 
assumption  e  <  1  made  at  the  beginning  of  this  section.  D 


3.  Discretization  of  the  MMNS  method 


Since  in  practice  we  usually  process  discrete-time  signals,  it  is  very  important  to  consider 
the  discretization  of  the  MMNS  method  proposed  in  section  2.  To  do  so,  we  need  some 
notation. 

For  any  number  X  with  X  >  e  and  positive  integer  m,  let  Af/2(2/n  +  1)  denote  the  set 
of  (2 m  +  1 /dimensional  vectors  a  —  (a(A:)}  €  <C2m+1  such  that 


Li  t  sln2I(-.~il°w -/.(-) 

7T2  m  —  —  —  v/fl/ 


2  7T2  m 


k—-m 


Z  X 


for  —  m  <  n  ^  m. 


(3.1) 
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For  X  >  e,  let  =  {z*(fc)}  be  the  unique  element  (the  existence  and  the  uniqueness  will 
be  shown  in  lemma  4)  in  Mlx(2m  +  1)  such  that 

\\zkJ  =  min{||a||;  a  =  (a(*)}  6  Mll(2m  +  1)}  (3.2) 

where 

lla||  =  (  |a(*)|2 

\k=-m 


Finally,  let 


_  1  1 
”  2n2  m  Xj 


sin2jr  (m  “0, 


k=~m 


i-' 


£(*)• 


(3.3) 


Notice  that,  for  a  signal  /  €  BC  and  any  constants  X  >  €  ^  0  and  any  positive  integer 
m,  we  can  always  construct  the  signal  f€tx  in  (2.6)  and  the  signal  'I'*  in  (3.3)  from  the 
given  data  f€(t)  for  t  e  [— 7\  T].  In  other  words,  the  MMNS  /*,*  given  in  (2.6)  and 
its  discretization  4^  in  (3.3)  can  be  found  for  any  /  €  BC  using  its  known  values  on  a 
segment. 

In  practice,  it  is  usually  difficult  to  get  the  MMNS  /6  x  in  (2.6).  A  practical  way  to 
compute  it  is  to  use  the  discretization  form  that  is  formulated  by  'I'*  in  (3.3).  We  have  the 
following  convergence  of  the  discretization  4^  of  the  MMNS. 

Theorem  2.  For  any  constant  X  with  X  >  e,  the  discretization  converges  to  f€tx  uniformly 
on  compact  sets  of  R  when  m  oo. 

It  is  interesting  to  notice  that  the  convergence  result  in  theorem  2  does  not  require  any 
additional  condition  for  a  band-limited  signal  /.  In  order  to  get  an  error  estimation  for  the 
MMNS,  an  additional  condition,  i.e.  /  €  BCyj  in  theorem  1  is  needed. 

To  prove  theorem  2,  we  need  several  lemmas. 


Lemma  3.  For  each  fixed  Xq  >  e,  there  exists  M  >  0  such  that ,  when  m  >  M  and  X  >  Xq, 
the  set  Ml\(2m  +  1)  defined  in  (3.1)  is  not  empty  and  114 II  <  Cx0,  where  Cx0  is  some 
positive  constant  and  independent  of  m  and  X  with  X  ^  Xq. 


Proof.  By  lemma  1,  for  8  =  (X  —  e)/3,  there  exists  gs  6  L2[—  1, 1]  such  that 
where 


/iM 


— l  r 

2W-i 


gtWds. 


Thus, 


In  U 


fs(co)e~ua>  do)  —  f(t) 


<  (A  -  «)/(3 Vn)  for  all  t  e  R. 


In  other  words. 


1 

r 

e--i-  / 

2n  j 

f-2jr 

2tt  J. 

<  (X  —  e)/3  for  all  t  € 


(3,4) 


Since  the  space  of  continuous  functions  is  dense  in  L2[— 1, 1],  there  exists  h&  €  C[—  1,  1] 
such  that 


ll*a-M(i)<(A-€)/3. 


1648 


X-G  Xia  and  M  Z  Nashed 


r2”  l 

A-1  tO)  1 


T-  j  (gs(s)  -  hs(s))e'sai ds  dw  —  J  ^\gs(s)  -  hs(s)\ ds  dco 

<  \/2(X  -  c)/(3j r)  <  (X  -  e)/3  for  all  t  €  R. 


By  (3.4)  we  have 


—  C~i,a>—  f  h,{s)&a&s&u>  -  Mt)  ^  (2X  +  e)/3  for  all  t  €  [-1, 1]. 

2?r  ./_2*  27T 


That  is, 


1  f1  sin27r($  -  0 


5»/- 


/ti(j)dj-/€(0  <  (2X  +  €)/3  for  all  re  [-1,1].  (3.5) 


Since  is  continuous  on  [—1, 1],  the  following  sum 

_Li  V  S[n2n(m~tK  fi\ 

2n2  m  t“_  \m/ 


k=-m  m 


converges  uniformly  to 


1  sin27r(5  —  t ) 


U 


/tj(s)ds 


for  t  e  [-1,  1].  Therefore,  for  (X  —  e)/3,  there  exists  M  >  0  such  that,  when  m  >  M,  we 
have 


LI  V'  sin2;T  (£"«),-  /AI  _  _L  r  Sin2?r(j-m) 

*2»ii  'w  2 W_, 


-/ia(j)ds 


^  (A  —  0/3  for  |n|  <  m. 
Combining  this  with  (3.5),  we  obtain 

_Li  f'  /"I 

Ott2  ™  Z_^  A  _  «  5  \  m 


k=—m  m  m 


(=)-'•© 


<  X  for  all  |n|  <  m. 


Let  a(k)  =  &a(£)  for  |A:|  <  m.  Then,  (a(X)}  e  Mlfom  +  1).  This  proves  that  the  set 
Mlfam  +  1)  is  not  empty  when  m  >  M. 

Moreover,  the  above  M  can  be  large  enough  such  that,  when  m  >  M, 


£>(*) l2<  f  \hs(s)\2ds+l 

X=-m  *'~1 

=  J  |hu_e)/3(5)|2dj  +  1 


Let  (X  -  0/3  <  1  and 


^  (ll£(X— «)/3 11(1)  +  (A.  —  e)/3)2  4-  1. 


2^  ill/2 


Ck0  =  {(llftXo-e)/3ll(l)  +  l)2  +  1} 


Then  lemma  3  is  proved. 
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Similar  to  lemma  2,  since  the  set  Mll(2M+l)  is  closed  and  convex,  we  prove  lemma  4. 
Lemma  4.  For  every  m  and  X  with  X  >  e,  there  exists  a  unique  element 
zm  ~  (^W)  £  +  1) 

such  that 

\\zkJ  =  min{||o||  :  a  =  {«(*)}  €  Mlfam  +  1)}. 

Recall  that  a  family  of  functions  of  a  complex  variable  is  called  a  normal  family  if  every 
sequence  of  the  family  contains  a  subsequence  which  converges  uniformly  on  compact  sets. 
It  is  known  that  a  family  of  functions  that  is  uniformly  bounded  in  any  compact  set  is  a 
normal  family.  We  use  this  result  in  the  proof  of  the  following  lemma. 

Lemma  5.  For  each  Xq  ( >  c),  the  family  of  functions  [^^(t)}^  0>m  defined  in  (3.3)  is  normal 
when  t  is  extended  to  the  complex  plane  C. 

Proof.  The  functions  in  (3.3)  can  be  rewritten  as 

°  "*  k=—m 

=  3-  [*  -  Y]  c,ka/mzk(k))d(o. 

J-2jx  \2nm  £?m  ) 


Thus, 


\K(z)\< 


B23rl*l  f2*  I  1 


Alt2 

e2jr|z|  j  m 


fit* 

J-Tj,  \in  kt?m 

. E 


i k(i)/m  X 


cm 


“T  3i(2=Ltl)'flc^. 

This  proves  that  the  family  {^^x^Xo.m  is  normal. 
Define 


d  ax 


for  X^  Xq 


Z  €  C. 


□ 


=  (3-6) 

k=~m 

Lemma  6.  For  each  X$(>  c)  the  family  is  normal  and  its  limit  functions  are 

1  band  limited . 


Proof.  The  proof  of  normality  is  similar  to  the  proof  of  lemma  5  by  using  lemma  3. 
By  Fatou’s  lemma  and  lemma  3,  it  is  easy  to  prove  that  all  limit  functions  of  the  family" 
nre  i°  when  z  is  restricted  to  the  real  line  R.  Therefore,  by  the 

Paley-Wiener  theorem  (see  [1]),  lemma  6  is  proved.  □ 

Lemma  7.  Let  g(t),  be  as  defined  in  (2.5).  For  a  fixed  e,  let  h(X)  =  ||g€,xll(i)-  Then  the 
function  h(X)  is  continuous  for  X  >  €. 
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Proof.  Let  X0  and  A,  be  any  two  positive  numbers  such  that  X0  >  Xj  >  c.  For  any  X  >  Xu 
define 

Je,x(<w)  =  ^  /  eu<ugf,x(5)  dj. 


Then 


_L  />1 

2tz 


f'  Jsa,gt.x(s)ds  <  e'-'^H^.xlld)  <  ^e'^l^.x.lld)  for  X  >  A., . 

This  implies  that  the  family  {ff,x(<u)}x>x,  is  normal.  Similar  to  lemma  6,  its  limit  functions 
are  1  band  limited.  Let  be  one  of  its  limit  functions.  Let  X(n)  -»  Xj  and  suppose 
that  the  sequence  {g(,x(n)}  converges  to  ht  i0  uniformly  on  compact  sets  of  C.  Then,  there 
exists  /if,Xo  €  L2[-l,  1]  such  that 


jS<F,x0(") 


2  W-i 


By  the  definition  of  /{,x(n)  we  have 

^  ^tag(Mn){w)Aw  -  MO 


=  I  /«.X(.)(0  -  MO  I  <  Hn)  for  t  6  [-1, 1]. 


<X0 


_L  f 

2n  J _2jr 

Let  n  ->  oo  in  the  above  inequality, 

*  I—  r-e--L  [' 

\2n  J-2n  2w 

Thus,  hf  x0  e  BTf  Mr  Therefore, 

Pe,X0ll(l)  ^  ll^f.Xoll(l)- 

On  the  other  hand,  for  any  B  >  0, 

CB  -  fB 

I  l^,x0(«>)l2da>  =  lim  /  |ge,x(n)(a))|2d<u 

J-B  n—too  J _B 

/OO  _ 

lg<.X(n)(fc>)|2dtt>  =  limn  -►oo  ll^e.X(n)  II  (C 

'OO 


for  /  6  [-1,  1]. 


(3.7) 


1  _  2  1  2 
=  ^-linnn^ooll^,X(n) ll(i)  ^  ^ll^€.x0ll(i)- 


Therefore, 


,Xo  ll(oo)  ^  rr — II^Ao  11(1) 

V  27T 


In  other  words, 

ll*«.Xo  11(1)  ^  II^.Xollo)- 

By  (3.7)  and  lemma  2,  we  have  proved  that  =  g€M.  Therefore,  we  have  proved 

lim  h(X)  =  h(X o).  (3-8) 

*-^0+ 

Now  we  want  to  prove  that 

lim  h(k)  =  h(k0). 


(3.9) 
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Let  k\  be  any  positive  with  0  <  k\  <  Ao-  Let  JA(n)}  be  any  sequence  of  numbers  with 
k\  ^  k(n)  ^  k(n  +  1)  <  A0  that  converges  to  A0.  Define 


hn 
Define 

Then 


,  N  ,  Kn)-kx  ,  , 

.*,(*)  +  - —  &r.A<,(j) 

A  0  —  A] 


for  s  €  [-1, 1]. 


(3.10) 


I  Kit)  -  =  |^L  -n^f  t)hn(s)As-  Mt) 

A(«)  —  XA  I  1  sin2n(s  —  t) 

A0-X,  )\2^J. 

A («)  -  A 


<  1 


+■ 


n  j_  [' 

i  2rr2  J_ , 


i 

sin  2nr(,y  —  t) 


geM(s)ds  -  fe(t) 


s  —  t 


I.  A  Hn)  -  *1  \ . Hn)  -  A, 

nll(D'<  I  1 - ^  ^  I  ll&.A.  11(1)  +  — ■_  ,  llgf.Xoll(l)- 


A0-A 

V  *o-A,  / 

This  implies  that  hn  €  BT€tx(n)- 
From  (3.10)  we  have 

II* 

Letting  n  -+  oo  we  obtain 

lim„_ >oo ||/in ||(i)  ^  Hfc.jJ(i). 
Since  we  have  proved  that  /j„  e  BT€Mn)t 

ll^,X(n)ll(l)  ^  \\hn  II (1)* 

This  proves  that 

linin-^ooll<?€,X(«)ll(l)  ^  II^.Xoll(l)* 

On  the  other  hand,  the  following  is  clear: 
ll£ftMn)ll(l)  ^  II^Aoli* 

Thus, 


g€M(s)ds  -  f€(t) 


k(n)  —  Aj 

A,  +  ;■  .  ]Ao  =  k(n). 

Aq  —  A] 


Aq  —  Aj 


llg€.A(n)l|(I)  =  ll^«fJLoll(l) 

/i— ►oo 

that  is,  (3.9)  is  proved.  This  proves  lemma  7.  □ 

We  are  now  ready  to  prove  theorem  2. 

Proof  of  theorem  2.  By  (3.3)  and  (3.6)  we  have 

<(0  =  L  <t>tn(co)c-i'“da>. 

If  we  can_  prove  that  every  limit  function  of  the  sequence  {^)  is  f(ik,  theorem  2  is  proved. 
Assume  h(k  is  a  limit  function  of  the  sequence  {^}.  Without  loss  of  generality,  we  may 
assume  the  sequence  {»!'*}  converges  to  h(  k.  Since  the  family  {'!'*}  for  a  fixed  A  is  normal 
by  lemma  5,  the  convergence  is  uniform  on  compact  sets  of  C.  By  lemma  6,  the  family 
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* 

{</£}  is  also  normal  for  a  fixed  A.  We  may  assume  that  the  sequence  {4>xm}  converges  to  h(,k 
uniformly  on  compact  sets  of  C  and 

h^t)  =  J—  f  he,x(co)c~i,0>dco. 

2 7t  J-2 


-2n 


By  Lemma  6,  there  exists  h€tx  e  L2[— 1,  1]  such  that 
ht.x(co)  =  2.  ['  e™he,x(s)ds. 

2it 

Taking  the  limit  as  m  oo  in 


Km/  '  \m> 

and  using  the  continuity  of  h(tX(t)  and  fe(t)  for  t  €  [—  1, 1],  we  obtain 

|A£l*(0  -  /«(0I  <  *  »  6  [-1, 1]. 

This  proves  that  6  BT e,x.  Thus, 


(3.11) 


We  next  want  to  prove  the  reverse.inequality. 

For  A  >  e,  choose  fi  such  that  A  >  /a  >  e.  For  this  /a.  we  have  €  BT(  x-  Using 
the  same  argument  as  jn  the  proof  of  lemma  3,  for  (A  —  /x)/3  there  exists  6  U[— 1,  1] 
such  that 

A  —  n 


Thus,  if  we  let 


then. 


1  /'l  sin2;r(j  -  r)  . 


-J2k  —  fl  .  r  i  n 

I«.,m«  - /«.m(OI  < - ^  for  re  [-1,1]. 


jr  3 

Therefore,  there  exists  M  >  0  such  that  when  m  >  Af  we  have 


1  I  A  sin 2;r  (£-*),  (k 


2;r2 


k——m 


k_  _  n_ 
m  m 


8e,ti 


(=)-*(=) 


2A±£<;i_ 


By  (3.1),  this  implies  that  g(tli  =  {g£,/i(^:)}  €  Ml\(2m  +  1).  Therefore, 
\\9e.n  II  ^  11^  II- 


Thus 


Therefore, 


limm_^oo"-|l^mli2  ^  limm-+oo“  II^.M  I!2  —  II^€,mH(1)* 
m  m 


✓ _  !  \!/2 

nim^cc-llz^H2)  <  llgf.^lld) 


+ 


X  —  fi 


(3.12) 
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On  the  other  hand,  for  any  B  >  0, 


/  |Af.x(«)|2  =  Km  f  \<p^((o)\2dw 
J-B  m-tco  J _g 

/•nm 

»  /  \4>km((o)\2d co 

J  —nm 

/nm  /  -1  m  \  /  i  m  _ \ 

( —  y  eka,,mzi{k) )  ( y—  y  t-'kaimziik) 

™  \2lrm  k=--m  )  \2nm  k~m  ) 

/nm  j 

...  (2^ki(t,|!d" 

=  S  lzi<i)l2. 


^  lim„ 


(3.6)-p — 

—  Iimm_>oo 


dco 


k=—m 


Therefore, 


Since 


we  have 


By  (3.12), 


l^lljoo)  <  2-lim«^oo-  y  kmWI2- 

Z7T  m  r — ' 


k~—m 


ll^ll?,)=27rll^ll(oo) 


ll^f.ill(i)  ^  Kmm-+oo  J  '  |zm (*)| 


k=—m 


11(1)  ^  ll^./zllO)  + 


k  —  p 


Letting  p  ->  k,  by  the  continuity  of  h(k)  on  (6,  oo)  in  lemma  7,  we  have 

By  (3.1 1),  we  have  proved  that 

»A«.Jt  lid)  =  ll^xll(i). 

Since  h(<\  e  BT€^  hy  lemma  2,  we  have 

h€,x(s)  =  g€tx(s)  for  s  e  [-1, 1],  almost  surely. 

This  proves  that  vp*  converges  to  f€tx  as  m  — ►  oo. 


□ 


4.  Band-limited  signal  spaces  B£7 

The  error  estimate  result  in  theorem  1  is  for  band-limited  signals  in  the  spaces  BCy.  The 
conditions  in  (2. 1>— (2.3)  defining  these  spaces  are  rather  abstract.  In  this  section,  we  study 
their  properties  and  simplifications.  To  do  so,  let  us  first  review  the  prolate  spheroidal 
wavefunctions  (see  [25,29,30]). 

Let  K  be  the  following  operator 

=  f  . --■/(*)  dr  /  e  L2[-T,  T].  (4.1) 

J-T  —  *) 
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It  is  clear  that  the  operator  K  defined  on  L2[— 7\  T]  is  self-adjoint  and  compact.  Let  fa  and 
X*,  k  =  0,  1, 2, . . . ,  be  the  eigenfunctions  and  the  corresponding  eigenvalues  of  the  operator 
K,  respectively,  such  that  <pk,  k  =  0, 1, 2, . . . ,  form  an  orthogonal  basis  for  L2[-T ,  T]  with 

f\(t)M)dt  =  XkS(j-k) 

where  S(n)  =  1  when  n  =  0  and  8(n)  =  0  otherwise.  Moreover,  we  have 

1  >  A-o  >  Xi  >  •  •  •  >  0  and  A.*  ->  0  as  k  -+  oo.  (4.2) 

From  (4.1), 

<pk(t)  =  -  [T  SinQ(f  ~T)^(t)  dr  t  e  [-T,  T]  k  =  0, 1, 2 .  (4.3) 

h  J_T  7t(t  -  T) 

Although  the  above  eigenfunctions  </>*  are  only  defined  on  the  interval  [—  T,  7"],  they  can  be 
easily  extended  to  the  whole  real  line  R  by  letting  t  take  an  arbitrary  real  value  in  formula 
(4.3).  By  doing  so,  it  was  proved  in  [29, 30]  that  the  extended  eigenfunctions  <f>k  for  t  e  R 
have  the  following  orthonormality: 


<pj(t)<pk(t)dt  =  S(j  ~k). 


These  extended  eigenfunctions  <pk  are  called  the  prolate  spheroidal  wavefunctions  in  [29, 30]. 
It  was  also  proved  in  [29, 30]  that  these  prolate  spheroidal  wavefunctions  <pk,  k  =  0, 1, 2, . . ., 
form  an  orthonormal  basis  for  the  Q  band-limited  signal  space  BC.  Thus,  any  f  €  BC  can 
be  expanded  as 


where 


and 


00 


fit)  =  ^a*d>*(f) 

k=0 


t  e  R 


ak=  f°°  f(t)M)dt  = 

J  —oo 


~~  C  f(t)Mt)dt 

*-k  J —T 


11/11(00)  = 


oo 


(4.4) 


(4.5) 


(4.6) 


and 

ll/I(T)  -  t,4h.  (4.7) 

k= 0 

We  now  have  the  following  result. 


Theorem  3.  Let  f  be  an  Q  band-limited  function  and  have  the  expansion  (4.4),  (4.5).  If  - 


E  |  -2y/3  <  00  f°r  SOme  y'  0  ^  y  <  \ 

k=  0  h 


then  f  €  BCy. 
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Proof.  For  A  >  0,  let  DA  be  the  truncation  operator  on  L2(R):  for  h  e  L2( R), 


h(t)  t  e  [—A,  A] 

0  otherwise. 


(DAh)(t)  = 

By  (4.4)  and  (4.5),  letting  F  denote  the  Fourier  transform,  we  obtain 

oo 

f(co)  =  Ff{co)  =  akFM<o ) 

oo 

=  J2^DnFDT4>k(t)/h 


Let 


Then 


and 


and 


Let 


Then 


and 


Let 


k= 0 


oo 

=  DnT±FDTM). 


/n  =  E0*^*  =  Ai  ]E  ^  FDT<pk . 


k=  0 


Jfc=0 


/n  =  E 


*=0 


ll/»-/ll(oo)=  E  fl* 


*=/l+l 


II /n  -  /life)  =  1*  E  fl* 

£=n+I 


/„  =  dqf 


g„  =  2nDj  ~<t>k- 

*-k 


k= 0 


fn  =  D^FSn 


"  a? 


Ilia  life)  —  Ellll^ll'r)  -  E  j T- 
*=0  *=0  A* 


°k  -  I-2K/3 
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Then  by  the  assumption 

OO 

k= 0 

we  have 


<  OO 


ii  ii(r) — yykh 


2\  “2y/3 


*= 0 


and 


II /.  -  /11(D)  =  2*  E  blk 


2l  1  -2y/3 
* 


*=n+l 


By  (4.2),  for  any  5  >  0,  there  exists  N  such  that 


,1-2x73 


for  k  >  AT  4*  1 


and 


Then 


and 


k[-2yn  >8  for  k^N. 


II  fN  ~  film  <  2n  £  J#  ^  2nB8 

k=N+\ 


ll^ll(r)<E^“T^5 

k= 0 

For  0  ^  y  <  5>  ^ere  exists  a  constant  C  >  0  such  that 


ayllgwll?D<c- 


Let 


Then 


and 


1  17 

fy/2Bnl  —  2 x^SN- 


A  A 

/n  —  E>nfy/2l&8 

Wfjim-hw)  *kJ2B^s 


(yiBnsJ  II ll(oo)  =  (V2fljra)  -^==  II^atIUd 

^  By/2(2n )(y-V/28Y/2 Hg#  ll(r) 

<  B>'/2C1/2(2n-)(>,_1)/2  for  0  <  y  < 

This  proves  that  /  satisfies  (2.1) — (2.3). 


□ 
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Before  going  to  the  next  result,  we  recall  a  result  on  operator  equations.  Suppose  that  K 
is  a  compact  linear  operator  from  Hilbert  space  Hi  to  Hilbert  space  H2.  Let  0,2  ^  Q\  ^  •  •  • 
be  the  sequence  of  eigenvalues  of  the  operator  K*K,  and  vi,U2. ...  be  the  associated 
orthonormal  eigenfunction  sequence.  Let  pn  =  8~l  and 

un  =  finKvn.  (4.8) 


Then  {«„}  is  an  orthonormal  sequence  in  H2  and 

v„  =  p.nK*un.  (4.9) 

We  call  the  sequence  {un,  v„\  p„}  a  singular  system  for  the  operator  K.  Then,  Picard’s 
theorem  can  be  stated  as  follows  (for  details,  see,  for  example  [10,  20]). 

Proposition  2.  Let  K  :  .Hi  ->  H2  be  a  compact  linear  operator  with  singular  system 
{«„,  u„;  n„).  In  order  that  the  equation  Kz  —  g  has  a  solution,  it  is  necessary  and  sufficient 
that  g  6  Ker(^'*)1(=  Closured  AT))  and 

<  00 

n=0 

where  ( , )  is  the  inner  product  on  H2. 


We  now  have  the  following  result. 


Theorem  4.  Assume  that  f  is  £2  band  limited  and  with  expansion  (4.4),  (4.5),  Then : 

(i)  f  e  BCy  with  y  =  0  if  and  only  if  its  Fourier  transform  f(co)  or  -f(-co)  for 
a)  e  (— £2,  £2)  is  a  piece  ofT  band-limited  signal ; 

(ii)  f  €  BCy  with  y  —  0  if  and  only  if 


<  co. 


Proof  of  (i).  ‘If  part':  If  -f(-co)  for  co  e  (-£2,  £2)  is  a  piece  of  T  band-limited  signal, 
then  there  exists  g  6  L2[-T ,  T]  such  that 

1  fT 

f(co)  =  —  /  clS0Jg(s)ds  co  e  (-£2,  £2). 

2n  J _7* 

For  any  8  >  0,  let  gs  =  g.  Then,  fs(co)  =  f(co)  for  co  €  (-£2,  £2).  Let  C  =  2~||g||(7> 
Then 

ll/«“/ll(Q)=0<* 


and 

\\fs\U00)  -  ^Lll&lkn  =  ^=ll*ll(n  =  C. 

Thus  f  e  BCy  for  y  —  0. 

‘Only  if  part’:  If  f  e  BC$,  then  for  every  8  >  0  there  exists  gs  e  L2[~T ,  T]  such  that 
ll/j-/U(Q)<*  and  \\gs\\(T)  ^  2jtC 
where  C  is  a  constant  and 

1  rT 

fsico)  =  —  j 
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Thus,  the  function  family  {/a}  is  normal.  In  fact, 

\Mz)\  <  ^'z'T\\gs\Ur)  <  cV2Te™  for  all  S  >  0  z  €  C. 

l7t 

Therefore,  for  every  sequence  {<$„}  that  tends  to  0  when  n  ->  oo,  there  is  a  subsequence 
}  such  that  {/*„  }  converges  to  a  T  band-limited  signal  h  uniformly  on  compact  sets  of 
C.  On  the  other  hand, 


2n  J_s 


e-i,a,A .(®)d®  -  fit) 


*  ~^r nr 


Letting  j  -*■  oo,  we  obtain 


1  rn 
2n  J_a 


&~i,0>h(a>)dco  =  f{t). 


2n  j_n 

This  proves  h(co)  =  f{w)  for  w  e  (-J2,  f2),  that  is  /  is  a  piece  of  a  T  band-limited  signal. 

Proof  of  (ii).  Let  Hi  =  L2[-T,  T]  and  H2  =  BC0.  The  inner  product  on  H2  is  the 
usual  I2(R)  inner  product.  Let  K  be  the  integral  operator  given  in  (4.1).  By  part  (i), 
K(L2[-T,  71)  =  BCq.  By  theorem  3,  all  finite  linear  combinations  of  the  eigenfunctions 
<pk  are  in  BC0.  Thus,  Closure(S£0)  =  BC  and  therefore,  BC  =  Closured  AT)),  where  the 
closure  is  under  the  usual  L2(R)  norm.  Also, 

/T/(0=  f  Smf  -  ~—/(j)dJ  for  /  e  BCq. 

J-O O  tr(s  - 1) 

From  (4.8)  and  (4.9), 

Un  =  dlKK*un. 

Hence,  {/x2}  are  eigenvalues  of  the  operator  KK*  and  {«„}  are  the  corresponding 
eigenfunctions.  Since 

*  sin  &(s  —  0 

K'4>n(t)  =  /  - T- - T—<t>n(s)dS  =<t>nit) 

J —00  \S  t) 

we  have 

KK*<f>n  =  K<t>n=kn<l>n. 

Thus  by  the  completeness  of  the  sequence  {<pn}  we  have 
kn  =  fi~2  and  <pn=Un- 
By  proposition  2, 

00  00  a2 

feBC0  iff  y'A.;,|(/,0n)l2  <  00  iffHr-<0°- 

n= 0  n= 0  A" 

This  proves  (ii). 

Combining  theorems  1,  3  and  4,  we  have  the  following  corollaries. 

Corollary  1.  For  0  <  y  <  j,  if 


□ 


OO 


00  a2 


then. 


fit)  =  J2ak<Pkit)  anti  .1-2F/3  < 

k-0  *= 0  *k 

I/«a(0  -  /(Ol  <  Ce(l~2y)/3  t  €  R. 


OO 
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Corollary  2.  Let  f  be  Q  band  limited.  If  its  Fourier  transform  f(co)  for  co  €  (— £2,  £2)  is  a 
piece  of  a  T  band-limited  function ,  then 

l/c.*(0  -  /(»)!  ^  Ccl/3  t  e  R. 


5.  Remarks 


In  [14, 17],  approximations  of  S2  band-limited  signals  /  are  considered.  These  authors  use 
finite  data  of  /  on  [-T,  T]  to  recover  the  whole  /  on  [-7\  T].  The  optimal  algorithm  in 
the  worst  case  for  the  recovery  has  been  found  in  [14, 17]  as  follows. 

Let  Om  be  an  information  operator  which  is  a  mapping  Om  :  BC  — ►  Cm, 

Omf  =  (/(/.).  /(*2),.... /««))• 

An  algorithm  $  is  a  function- valued  mapping  on  OmBC.  The  optimal  algorithm  using  Om 
in  the  worst  case  takes  the  form: 

•  —  h 

where  the  coefficients  b\, &2, . . . ,  bm  are  determined  by  the  solution  of  the  linear  system 


sin £l(tn  —4) 
k  ln~  tk 


=  fUn) 


n  =  1, 2, . . . ,  m. 


We  can  see  that  this  is  similar  to  the  discretization  of  the  MMNS  in  (3.1H3.3). 

As  we  have  already  stated,  a  band-limited  signal  is  the  restriction  of  an  entire  function 
to  the  real  line.  But  it  is  more  than  this.  The  Paley-Wiener  theorem  (see  [1])  gives  a  direct 
characterization  of  band-limited  signals;  namely,  a  signal  in  L2(R)  is  In  band  limited  if 
and  only  if  it  is  the  restriction  of  an  entire  function  and  is  of  exponential  order  on  the 
real  line.  This  provides  a  powerful  property  for  extrapolation  of  band-limited  signals  that 
distinguishes  the  problem  within  the  realm  of  analytic  continuation  of  analytic  functions, 
and  makes  finer  and  stable  recovery  results  possible. 

There  is  considerable  literature  on  uniform  and  nonuniform  sampling  theorems  for  the 
recovery  of  band  limited  and  other  classes  of  signals  from  a  countable  set  of  sample  values 
(see  [2,  3,  12,  23,  37]),  the  simplest  and  most  celebrated  version  being  the  Shannon- 
Whittaker  theorem,  which  asserts  that  a  n  band-limited  signal  can  be  reconstructed  via  the 
cardinal  series 


f(0  =  £  fin) 

n~—oo 


sin7r(f  —  n) 
n(t  -  n) 


Various  error  estimates  (truncation,  jitter,  amplitude,  and  aliasing  errors)  are  also  known. 
The  problem  of  signal  extrapolation  from  an  interval  (which  usually  has  a  small  length)  is 
markedly  different  from  the  reconstruction  of  the  signal  /  via  a  sampling  expansion  theorem 
(which  utilizes  values  of  /  on  an  appropriate  infinite  sequence  with  no  accumulation  point). 

As  we  have  shown  BCo  is  the  range  of  the  Hilbert-Schmidt  compact  linear  operator 
(4.1)  on  L2[-T,  T).  BCo  is  nonclosed  in  L2[-T,  T],  Nashed  and  Wahba  [21,22]  have" 
.shown  that  the  range  of  a  Hilbert-Schmidt  compact  operator  AT  is  a  reproducing  kernel 
Hilbert  space  (RKHS)  Hq  with  reproducing  kernel 

Qit,s)  =  J  K(t,u)K(s,u)du 
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where  K(t,  u)  is  the  Hilbert-Schmidt  kernel.  The  inner  product  on  Hq  is  given  by 
(/,,  f2)Q  =  {K'fu  K]h)  for  fu  fi  in  Hq,  where  is  the  Hilbert  space  (Moore-Penrose) 

generalized  inverse.  Equivalently, . 

</i./2>e  =  J  TPi(s)p2(s)ds 

.where  pt  is  the  element  of  the  minimal  norm  which  satisfies  Kp  =  fh  corresponding  to  ft 
in  BC o  for  i  =  1, 2.  We  recall  that  a  Hilbert  space  H  of  functions  f  on  an  interval  J  is  said 
to  be  a  RKHS  if  all  the  evaluation  functionals  E,(f)  =  f(t),  f  €  El,  for  each  fixed  t  6  1, 
are  continuous.  Then  by  the  Riesz’s  representation  theorem,  for  each  t  €  J,  there  exists  a 
unique  element,  call  it  Q„  in  H  such  that  f(t)  =  {/,  Q,),  f  €  H,  where  ( , )  is  the  inner 
product  on  EL  Let  Q(t,  s )  =  ( Qs ,  Q,)  for  s,  t  in  J;  this  is  the  reproducing  kernel  (RK)  of 
H,  and  the  space  M  with  RK  Q(t,  s )  is  denoted  by  He.  The  space  L2( J)  is  not  a  RKHS. 

The  Paley-Wiener  space  BC  of  band-limited  signals  with  band  [-;r,  n]  is  a  RKHS  with 
RK 


Q(t,s) 


sin^(r  —  s ) 


In  [23]  it  is  shown  that  there  is  a  strong  affinity  between  RK  Hilbert  spaces  and  sampling 
theorems,  and  general  sampling  theorems  were  established  for  signals  belonging  to  a  RKHS 
which  is  also  a  closed  subspace  of  the  Sobolev  space  ET1.  The  preceding  remarks  about 
BC0  and  the  other  related  spaces  being  RKHS  may  suggest  that  a  broader  framework  within 
which  the  type  of  extrapolation  results  derived  in  this  paper  may  also  hold. 
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A  Quantitative  Analysis  of  SNR  in  the  Short-Time 
Fourier  Transform  Domain  for  Multicomponent  Signals 

Xiang-Gen  Xia 


Abstract — A  quantitative  analysis  is  given  for  the  signal-to-noise  ratio 
(SNR)  in  the  short-time  Fourier  transform  domain  for  multicomponent 
signals  in  additive  white  noise.  It  is  shown  that  the  SNR  is  increased 
on  the  order  of  0{N/K),  where  K  is  the  number  of  components  of  a 
signal,  N/T  is  the  sampling  rate,  and  T  is  the  window  size.  The  SNR 
increase  rate  is  optimal  for  given  A\  For  this  result,  the  SNR  definition 
is  generalized,  which  is  suitable  for  signals  not  only  in  the  time  domain 
but  also  in  other  domains.  This  theory  is  illustrated  by  one  numerical 
example. 


L  Introduction 

Time-frequency  analysis  [1 1J— [12]  has  become  an  important  tech¬ 
nique  in  analyzing  wideband/nonstationary  signals  in  various  applica¬ 
tions  including  inverse  synthetic  aperture  radar  (ISAR)  imaging  [1], 
biomedical  signal  analysis  [2]— [3],  speech  signal  analysis  [4],  and 
FM  radio  communications  [5].  One  of  the  most  important  features 
of  this  technique  is  that  it  usually  increases  the  signal-to-noise  ratio 
(SNR)  in  the  joint  time-frequency  (TF)  domain.  This  is  particularly 
advantageous  for  signals  that  are  difficult  to  detect  in  the  time  or 
frequency  domain  alone.  The  reason  for  this  important  feature  can 
be  stated  as  follows.  A  joint  TF  transform  usually  spreads  noise 
from  one  dimension  (the  time  or  frequency)  into  two  dimensions 
(the  joint  time  and  frequency)  while  it  usually  concentrates  a  signal 
in  localized  regions  in  the  TF  plane.  A  number  of  research  results 
on  the  estimation  of  time-varying  frequencies  have  appeared,  such  as 
[5H7]  with  Wigner-Ville  distributions.  However,  there  is  little  on 
quantitative  analysis  for  the  SNR  increase  for  any  joint  TF  transform, 
which  is  certainly  an  important  issue  in  practical  applications  in  signal 
detection  by  using  thresholding. 

In  the  conventional  SNR  definition,  the  mean  power  is  taken  over 
the  whole  domain  of  a  signal.  If  the  signal  is  stationary  in  this 
domain,  this  definition  works  fine.  However,  if  the  signal  is  not 
stationary  in  this  domain,  such  as  a  single  tone  signal  in  the  frequency 
domain,  this  definition  is  no  longer  suitable.  In  this  correspondence, 
we  first  generalize  the  SNR  definition  so  that  it  is  not  only  suitable 
for  signals  in  the  time  domain  but  also  in  other  domains,  such  as 
the  frequency  domain  and  the  joint  TF  domain.  We  then  present  a 
quantitative  analysis  of  the  SNR  increase  rate  in  the  joint  TF  domain 
for  the  short-time  Fourier  transform  with  rectangular  windows,  where 
multicomponent  signals  in  additive  white  noise  are  considered.  The 
main  result  can  be  stated  as  follows. 

K  number  of  monocomponents  in  a  signal; 

T  window  size  for  the  short-time  Fourier  transform; 

N/T  sampling  rate. 
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Fig.  1.  Single  tone  signal. 


jV -point  discrete  Fourier  transform  is  performed  in  each  window. 
Then,  the  SNR  in  the  joint  TF  domain  is  increased  on  the  order  of 
0(N/K)  when  the  window  size  T  is  small  enough. 

This  correspondence  is  organized  as  follows.  In  Section  II,  we 
formulate  a  proper  definition  for  SNR  in  different  domains.  In 
Section  m,  we  present  the  proposed  quantitative  approach  to  analyze 
the  SNR  increase  rate  in  the  joint  TF  domain.  A  numerical  example 
is  presented  in  Section  IV  to  illustrate  the  proposed  approach. 

n.  SNR  in  Different  Domains 

The  conventional  signal-to-noise  ratio  (SNR)  is  defined  as  the  ratio 
of  the  mean  power  of  the  signal  over  the  mean  power  of  the  noise, 
where  the  mean  is  taken  over  the  whole  time  domain.  It  is  formulated 
as  follows.  Let  y[rc]  be  a  distorted  signal 

y[n]  =  x[n]  +  t ;[n],  0  <  n  <  N  -  1  (2.1) 


where  x[n]  is  a  signal,  and  T)[n]  is  an  additive  white  noise  with 
variance  a2.  The  SNR  is  defined  as 


SNR  = 


En-i 

n-0 


M»Ha 


N< 72 


(2.2) 


This  SNR  is  used  quite  often  in  describing  the  noise  level  relative 
to  the  signal  and  in  distinguishing  the  signal  from  noise  in  stationary 
environments.  When  the  SNR  is  too  low,  in  general,  it  is  impossible 
to  distinguish  the  signal  x[n]  from  y[n].  However,  for  some  special 
kinds  of  signals  x[n],  such  as  narrowband  signals,  it  is  possible 
to  detect  the  signal  in  the  Fourier  transform  domain,  even  when 
the  SNR  is  of  negative  decibels.  An  example  is  shown  in  Fig.  1, 
where  the  SNR  =  -11  dB  and  the  signal  x  is  a  single  tone 
signal. 

According  to  the  SNR  definition  in  (2.2),  an  orthogonal  transform 
does  not  change  the  SNR,  i.e.,  the  SNR  in  the  transform  domain  is 
exactly  equal  to  the  SNR  in  the  time  domain.  This  is  because  of  the 
energy  preservation  property  of  orthogonal  transforms.  This  implies 
that  the  SNR  of  the  signal  in  the  frequency  domain  in  Fig.  1(b)  is  still 
-11  dB.  However,  one  can  clearly  see  the  signal  in  the  frequency 
domain.  This  suggests  that  the  SNR  definition  in  (2.2)  is  not  proper  to 
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judge  the  possibility  of  detecting  the  signal  in  the  frequency  domain 
in  Fig.  1(b).  It  should  not  be  surprising  since  the  signal  in  Fig.  1(b)  is 
not  stationary,  and  the  mean  power  over  the  whole  frequency  domain 
is,  of  course,  not  proper  to  the  signal  with  a  single  spike. 

The  above  observation  suggests  that  the  SNR  definition  is 
transform-domain  dependent  and  should  relate  to  the  bandwidth 
of  a  signal  occupied  in  that  domain.  We  now  introduce  the  following 
SNR  definition  in  a  domain. 

Suppose  the  expression  (2.1)  is  already  in  a  transform  domain, 
where  n  is  the  discrete  variable  in  the  transform  domain.  Assume 
the  additive  white  noise  j/[n]  in  (2.1)  occupies  the  full  band  in  the 
transform  domain.  For  the  signal  at[n]  of  length  N,  0  <  n  <  N—  1,  let 

B={n:0<n<iV-l  and  |x[n]|2  >  0.5  ^max^  |*[n]|2  j 

(2.3) 

where  the  number  0.5  comes  from  the  common  3-dB  bandwidth 
definition  in  communications.  Then,  the  SNR  is  defined  as 

SNR  A  Engs  NnH!.  (2.4) 

^  \B\ 

where  \3\  denotes  the  cardinality  of  the  set  B.  Notice  that  this 
definition  is  similar  to  the  SNR  definition  in  communications,  where 
the  signal  is  only  considered  in  its  bandwidth. 

One  can  dearly  see  that  the  SNR  in  (2.4)  is  always  greater  than  or 
equal  to  the  SNR  in  (2.2)  because  the  mean  in  (2.4)  is  only  taken  over 
the  first  large  values  in  the  whole  domain.  With  the  SNR  definition  in 
(2.4),  the  SNR  in  the  time  domain  for  the  signal  in  Fig.  1(a)  is  —8.4 
dB,  but  the  SNR  in  the  frequency  domain  for  the  signal  in  Fig.  1(b) 
is  16.3  dB.  Although  about  2.6  dB  SNR  is  increased  over  the  original 
definition  in  (2.2),  the  SNR  in  the  frequency  domain  is  significantly 
better  than  the  old  SNR,  that  is,  -11  dB,  in  describing  the  signal 
characteristics  over  the  noise.  The  time  domain  SNR  increase  is 
consistent  for  relatively  stationary  signals  without  dramatic  jumps 
in  the  time  domain. 

III.  SNR  in  the  Joint  TF  Domain 

In  this  section,  we  analyze  the  SNR  in  the  joint  TF  domain  for  the 
short-time  Fourier  transform,  where  the  SNR  defined  in  (2.4)  is  used. 
In  order  to  do  so,  we  first  describe  a  multicomponent  signal  model. 

A.  Multicomponent  Signal  Model 

Throughout  the  rest  of  this  paper,  we  use  the  following  multicom¬ 
ponent  signal  model: 

K 

y(t)  =  £  Xk(t)  +  77(f),  0  <  f  <  To  (3.1) 

k=  l 

where  we  have  the  following  assumptions 

1)  f  is  the  continuous-time  variable  and  limited  in  the  finite 
observation  interval  [0,  To]. 

2)  77(f)  is  an  additive  white  noise  process  with  mean  0  and 
variance  <x2.  It  is  not  differentiable  at  any  time  f  €  [0,  To] 
and  independent  of  xk(t),  1  <  k  <  A. 

3)  For  each  fc,l  <  k  <  A',  xk(t)  is  a  monocomponent  time- 
varying  signal,  i.e., 

xk(t)  =  Ak(t)eJ*M  (3.2) 

where  Ajt(f)  is  the  slowly  varying  amplitude  envelope  of 
x*(f),  and  <Mf)  is  the  phase  of  xk(t).  The  magnitude  of 
the  first  order  derivative  A*(f)  is  upper  bounded  by  A*,  i.e., 
|Ajt(f )|  <  Ah  for  a  positive  constant  Ak ,  and  the  magnitude  of 


the  second-order  derivative  <t>k(t)  is  also  upper  bounded  by  <£*, 
j.e.,  \4>k(t)\  <  4>k  for  a  positive  constant  <j>k  for  all  f  €  [0,To]. 

4)  The  K  instantaneous  frequencies  4>k(t)>  1  <  fc  <  A,  are 
distinct 

Additional  details  on  multicomponent  signals  can  be  found  in  [8]. 
It  can  be  easily  shown  that  the  process  57(f)  in  (3.1)  has  locally 
stationary  behavior  [93— [10]  in  the  following  sense: 

|Ayy(f  +  U,  S  +  U)  -  Ryy  (f,  S)  \  <  C\u\  (3.3) 

for  a  positive  constant  C,  where  Ryy  denotes  the  autocorrelation 
function  of  57(f). 

As  a  remark,  the  nondifferentiability  assumption  2)  of  77(f)  makes 
sense.  An  example  of  such  processes  is  the  Wiener  process;  see,  for 
example,  [13].  This  assumption  implies  that  any  sampled  segment 
of  77(f)  in  any  time  interval  is  a  white  noise  and  has  flat  Fourier 
spectrum. 


B.  Short-Time  Fourier  Transform  for  Multicomponent 
Signals  and  SNR  Calculations 

For  each  monocomponent  signal  x*(f)  in  (3.1),  by  l)-3),  it  can  be 
shown  that  there  exists  e*  >  0  such  that  for  any  s  G  (e*,  To  -  ek) 

xk(s  +  t)  «  Ak(s)ei(*k(‘)+<(,)t},  t  € 

where  the  linear  term  Ak(s)t  of  f  does  not  appear  because  of  the 
“slowly  varying”  assumption  in  3)  on  the  amplitude  envelope  Ak (f )• 
Since  we  have  only  finite  many  monocomponent  signals  xk  (f)  in 
(3.1),  there  exists  e  =  min{ejt,  1  ^  h  <  K}  >  0  such  that  for  any 
s  €  (e,  To  -  €)  and  any  fc,  1  <  k  <  K 

xk{8  +  f )  *  Ak(sy^+<(s)t\  t  €  [-6, 6]  .  (3.4) 

where  e  depends  on  the  constants  To,  Ajt,  and  1  <  h  <  A . 

With  (3.4),  at  each  times  €  (e,  To  -  €),  we  apply  Appoint  discrete 
Fourier  transform  (DFT)  for  the  signal  y(t )  for  t  G  (s  -  § ,  s+  y  ]  with 
the  sampling  rate  N/T  for  T  =  2c.  For  convenience,  we  assume  that 
N  is  even.  The  DFT  is 

1  W?  /  T\  joi 

£  vll"**)#)*--*- 

ViV  q—  —  N/2+l  X  7 

0  <  l  <  N  —  l  (3.5) 


where  m  is  in  the  range  such  that  (m  —  N/2  +  l)T/iV  >  0,  and 
(m  +  N/2)T/N  <  To,  i.e., 


N  —  2 
2 


The  above  Py  can  be  decomposed  into 


A 

PyK  /]  =  53  7**  *]  +  fl 

0  <  1  <  N  -  1,  <m< 


(3.6) 


where  PXk [m,  /]  and  Ptj(m,  I]  are  defined  for  xk(t)  and  J/(0- 

1  ^  /  T\ 

P:klmJ]  =  -7=  E  **((m  +  «)w)e  ^ 

VJV  ,=-N/2+l  X  ' 

0<1<N-1  (3.7) 

1  /  T\  ZxiaL 

Pr,[m,l)=7=  E  vUm  +  q^y  -#~ 

*ly  q=-N/ 2+1  V  ' 

0<1<N-1.  (3.8) 
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Since  77(f)  is  a  white  noise  process,  for  each  m,  the  Fourier  spectra 
E(\Prj[mJ]\2)  are  flat  over  the  whole  frequency  domain  0  <  /  < 
N  —  1,  as  mentioned  in  Section  HI- A.  This  implies  that  the  mean 
power  of  the  noise  spectrum  P^[m,  /]  is  also  a2,  which  is  the  same 
as  in  the  time  domain. 

We  next  want  to  study  the  mean  power  of  PXk  [m,  /]  for  the  signal. 
Using  (3.4) 


1  '  (  T\ 

q=-N/ 2+1  X  7 

y/N  k\  NT 


N/2 

E  ■ 

2+1 


*£)T-2,rI 
- . 


Therefore 

|PZjk[mJ]|*|AJt 


(3.9) 


By  the  assumption  of  distinct  instantaneous  frequencies  <£*(m^)  for 
1  <  /:  <  A',  the  Fourier  power  spectra  \PXk  [m,  /]|2  are  located  at  K 
different  frequencies  4>>k(my-)T/(2n)^l  <  k  <  A\  This  implies 


K 


k=  1 

sxN 


»4H4)K'-“)-  <3io> 

Therefore,  for  each  fixed  time  s  =  in  the  frequency  domain 

Ei ft.Kij  ^Eh(4)|-  (311) 


max 
0  <  /  <  .v  —  1 


Now,  let  us  come  back  to  the  time  domain  signal  y(mjj).  The  noise 
mean  power  is  a2 .  The  signal  power  at  each  time  t  =  is 

K  |  /  rp  \  2 

^TW-v  •  <312> 

Jk=l  1  '  ' 


the  constant  in  the  SNR  increase  rate  0(N/K) 


Fig.  2.  SNR  increase  rate. 


Theorem  1:  For  a  multicomponent  signal  with  K  many  monocom¬ 
ponents,  the  SNR  in  the  joint  TF  domain  with  the  short-time  Fourier 
transform  with  the  rectangular  window  of  size  T  and  the  sampling 
rate  N/T  increases  over  the  SNR  in  the  time  domain  on  the  order  of 
0(N/K)  when  the  sampling  rate  is  fast  enough.  Given  the  number 
A',  this  increase  rate  O(NJK)  is  optimal. 

Proof:  The  first  part  has  been  proved  by  the  above  argument 
The  optimality  can  be  proved  by  taking  Ak(t)  =  1  and  <t>k(t)  =  cut2 
for  proper  constants  Ck  #  0  for  1  <  k  <  K  and  noticing  that  the 
inequalities  in  (3.9M3.12)  become  equalities  in  this  case.  □ 


IV.  Numerical  Example 

For  simplicity  in  computations,  we  choose  the  following  two- 
component  signal  model: 

y(t)  =  ej8**2  +  eJwt  5  +  77(f),  0  <  t  <  2  (4.1) 


where  77(f)  is  an  additive  white  Gaussian  noise  with  mean  0  and 
variance  cr 2  =  9.  The  window  size  for  the  short-time  Fourier 
transform  is  1/8.  The  following  constant  of  the  SNR  increase  rate  in 
terms  of  the  number  of  points  N  of  the  DFT  is  illustrated  in  Fig.  2: 


SNR tf  /  N 
SNRf  /  A" 

One  can  see  that  for  this  particular  signal, 


(4.2) 


By  comparing  (3.11)  with  (3.12),  it  is  clear  that  the  following 
relationship  between  the  SNR*  /  in  the  joint  TF  domain  of  (3.6)  and 
the  SNR*  in  the  time  domain  of  (3.1)  at  the  sampling  points  m^: 


SNRt/ 

SNR* 


>0  4 

A 


(3.13) 


where  0.5  comes  from  the  SNR  definition  in  (2.3H3.4).  Therefore, 
as  the  window  size  T  is  small  enough 


SNRf/ 

SNR* 


(3.14) 


Notice  that  the  assumption  of  small  enough  window  size  T  is 
equivalent  to  the  assumption  of  fast  enough  sampling  rate  N/T .  The 
derivation  of  (3.14)  implies  the  following  theorem. 


^^-»0.55§,  as  N  -00.  (4.3) 

SNRf  A 

From  Fig.  2,  one  can  also  see  that  the  constants  of  the  SNR  increase 
rate  have  large  variance  when  the  sampling  rate  is  not  large  enough 
but  almost  become  invariant  when  the  sampling  rate  becomes  large. 


V.  Conclusion 

In  this  correspondence,  we  have  quantitatively  analyzed  the  SNR 
increase  rate  in  the  joint  TF  domain  with  the  short-time  Fourier 
transform  over  the  SNR  in  the  time  domain  for  multicomponent 
signals  in  additive  white  noise.  We  have  shown  that  the  rate  of  the 
SNR  increase  is  on  the  order  of  0(iV/A’),  where 
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ABSTRACT.  In  this  article,  we  construct  two-dimensional  continuous/smooth  local  sinusoidal  bases 
( also  called  Malvar  wavelets)  defined  on  L- shaped  regions.  With  this  construction,  one  is  able  to  construct 
local  sinusoidal  bases  and  lapped  orthogonal  transforms  (LOT)  on  arbitrarily  shaped  regions.  This  work 
is  motivated  from  and  useful  in  object-based  video  coding,  where  a  segmented  moving  object  may  have 
arbitrary  shape  and  block  transform  coding  of  this  object  is  needed. 


1.  Introduction 

It  is  known  that,  in  block  DCT  transform  coding,  one  first  decomposes  an  image  on  a  rectangular 
region  into  8  x  8  or  16  x  16  blocks  and  then  does  8  x  8  or  16  x  16  DCT  on  each  small  block.  Due  to 
the  truncation  of  an  image  in  the  block  decomposition,  the  blocking  effects  with  block  DCT  degrade 
the  performance  in  decoding  at  a  low  bit  compression  ratio.  To  eliminate  the  blocking  effects,  the 
lapped  orthogonal  transform  (LOT)  has  been  developed  by  Malvar  et  al.  [15,  16],  where  overlaps 
between  adjacent  blocks  in  the  decomposition  are  used.  In  LOT,  a  smooth  transition  of  the  DCTs 
between  blocks  is  performed  and  therefore  the  blocking  effects  can  be  eliminated,  while  LOT  does 
not  increase  the  total  number  of  pixels  in  the  transform  domain.  For  applications  in  image  coding, 
see  also  [1]. 

Coifman  and  Meyer  [8]  generalized  LOT  from  discrete-time  signals  to  continuous-time  wave¬ 
forms  with  a  general  description  for  window  functions.  They  constructed  a  family  of  smooth  local 
sinusoidal  bases,  which  are  also  called  Malvar  wavelets  [18].  For  more  about  local  sinusoidal  bases, 
see  [1, 3, 4, 5,  8, 9, 10, 13, 17, 18, 19, 22, 23].  Further  generalizations  were  made  in  recent  literature, 
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see  for  example  [1,  3, 4,  11, 13,  14, 17,  19,  20,  21,  22,  23],  In  particular,  two  dimensional  nonsep- 
arable  smooth  local  sinusoidal  bases  were  constructed  in  [22]  on  rectangular  regions  and  in  [23]  on 
'hexagons.  Discrete  forms  were  discussed  in  [11, 14,  20, 21].  An  important  requirement  for  LOT  is 
that  the  domain  of  an  image  must  be  rectangular.  This,  however,  may  not  be  true  in  object-based 
video  coding  [2,  6,  7, 12],  In  object-based  video  coding,  one  usually  first  segments  moving  objects 
from  image  frames  and  then  codes  the  motion  vectors  and  the  segmented  moving  objects.  There  are 
two  intuitive  ways  to  code  a  segmented  object.  One  is  to  mask  an  object  using  a  larger  rectangular 
window  that  completely  covers  the  object  and  do  block  DCT/LOT  for  the  image  on  the  masked 
rectangular  region  (see  Fig.  1).  This  way  usually  wastes  bits  of  coding  when  the  shape  of  an  object  is 
not  regular  due  to  the  inclusion  of  redundant  areas  (see  Fig.  1).  The  other  is  to  code  the  boundary  of 
an  object  and  code  the  content  inside  the  boundary  separately.  Although  this  way  does  not  include 
redundant  area,  it  does  require  block  DCT/LOT  to  be  applicable  to  images  defined  on  an  arbitrarily 
shaped  region. 


FIGURE  1 .  Segmented  moving  object  and  rectangular  mask. 


Another  way  to  code  a  segmented  moving  object  is  between  the  above  two  ways,  which  masks 
the  object  by  using  small  rectangular  blocks  [see  Fig.  2(a)].  With  this  masking  method,  the  redundant 
area  is  clearly  smaller  than  the  one  in  Fig.  1.  The  question  now  is  whether  LOT  can  be  implemented 
for  the  rectangular  blocks  in  Fig.  2(a).  To  study  this  question,  we  decompose  the  mask  in  Fig.  2(a) 
into  two  parts:  a  rectangularly  shaped  region  part  as  shown  in  Fig.  2(b)  and  a  nonrectangularly  shaped 
boundary  region  part  as  shown  in  Fig.  2(c).  For  the  rectangularly  shaped  region  part,  the  standard 
constructions  apply.  Thus,  the  question  is  reduced  into  whether  LOT  can  be  implemented  for  the 
domain  shown  in  Fig.  2(c).  We  call  the  regions  with  the  shapes  in  Fig.  2(c)  L -shaped  regions .  As 
long  as  LOT  is  implementable  on  L-shaped  regions  in  Fig.  2(c),  LOT  is  implementable  on  all  masks 
shown  in  Fig.  2(a)  which  cover  arbitrarily  shaped  regions.  Based  on  this  observation,  in  the  rest  of 
this  article  we  focus  on  local  sinusoidal  bases/Malvar  wavelets/LOT  on  L-shaped  regions  shown  in 
Fig.  2(c). 

This  article  is  organized  as  follows.  In  Section  2,  we  construct  continuous/smooth  local  sinu¬ 
soidal  bases  on  L-shaped  regions.  In  particular,  we  present  a  set  of  conditions  on  two-dimensional 
window  functions  for  two-dimensional  continuous/smooth  local  sinusoidal  bases  on  L-shaped  re¬ 
gions.  The  conditions  are  different  but  similar  to  those  in  [8,  22, 23].  In  Section  3,  we  present  some 
numerical  examples  of  both  window  and  local  sinusoidal  bases  on  L-shaped  regions.  In  Section  4, 
we  briefly  mention  the  construction  of  LOT  on  L-shaped  regions,  i.e.,  the  discrete  version  of  local 
sinusoidal  bases  on  L-shaped  regions.  We  also  present  an  application  of  LOT  on  L-shaped  regions 
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rectangular  regions  L-shaped  regions 

FIGURE  2.  Segmented  moving  object  and  nonrectangular  mask. 

in  image  compression.  In  Section  5,  we  conclude  by  discussing  local  sinusoidal  bases  on  mixed 
regions  of  L-shaped  and  rectangular  regions. 


2.  A  Theory  for  Local  Sinusoidal  Bases  on  L-Shaped  Regions 


In  this  section,  we  build  a  general  theory  for  the  construction  of  local  sinusoidal  bases  on 
L-shaped  regions.  We  first  describe  the  problem  precisely. 

An  L-shaped  region  A  to  work  on  in  the  following  is  shown  in  Fig.  3(a).  The  region  A  consists 
of  three  small  rectangular  regions:  Ai,  A2,  and  A3  as  shown  in  Fig.  3(a).  In  what  follows,  small 
bold  English  letters,  such  as  x  =  (jcj,  X2),  y  =  (y\ ,  y2>  for  real  */,  y/,  l  =  1, 2,  always  denote  two- 
dimensional  vectors  in  R2.  The  goal  of  this  article  is  to  build  smooth  local  sinusoidal  bases/Malvar 
wavelets/LOT  defined  on  the  L-shaped  region  A  from  sinusoidal  bases  on  Aj. 

Let  /;,*(x),  k  6  Z,  be  an  orthonormal  basis  defined  on  Aj  for  the  signal  space  L2(A;), 
j  =  1,  2,  3,  where  L2(B)  denotes  all  square  integrable  functions  on  the  region  B,  A  trivial  method 
for  forming  an  orthonormal  basis  for  L2(A)  is  simply  to  use  the  truncation  window  xa7  (x),  1  for 
x  €  Aj  and  0  otherwise,  and  form  //,*(x)xa,(x),  j  =  2,  3,  k  €  Z,  x  e  A.  This  is  equivalent  to 

using  block  DCT  when  fjx  are  products  of  two  cosine  functions  with  discrete  forms.  Clearly,  the 
basis  elements  fj^  (x)xa,  00  may  have  discontinuities  which  may  cause  blocking  effects  as  discussed 
in  the  Introduction.  The  purpose  of  the  rest  of  this  article  is  to  construct  continuous/smooth  basis  for 
L2(A)  from  continuous/smooth  local  bases  fj defined  on  Aj,j  =  1 , 2,  3.  The  basic  idea  to  achieve 
the  goal  is  similar  to  previous  work:  include  overlaps  between  these  three  rectangular  regions  Aj 
and  replace  the  truncation  window  xa7  with  some  better  designed  windows  Wj.  Before  going  to  the 
details,  we  define  some  notations. 
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A 


A 


(a) 


FIGURE  3.  An  L-shaped  region  A. 


(b) 


2.1  Notations 

We  divide  the  L-shaped  region  A  in  Fig.  3(a)  as  follows  [see  Fig  3(b)]: 

Ai  =  {x  =  (xi,  *2)  •  ao  <  x\  <  a\,  bo  <  xi  <b\) 

A2  =  {x  =  (xi,  X2) :  a~\  <  x\  <  oq,  bo  <  X2  <  b\) 

A3  =  [x  =  (xi,X2):  a-i  <  xi  <  ao,  h-i  <  X2  <  ho) . 

where  «o  =  bo  —  0,  a_  1,  b~\  <  0,  and  a\,  b\  >  0. 

The  overlaps  between  the  three  rectangular  regions  Ay  are  shown  in  Fig.  3(b),  which  are 
bounded  by  the  dotted  lines.  The  symbols  e  and  S  denote  the  single  width  of  the  overlaps  between 
Ai  and  A2  in  the  xi  direction,  and  between  A2  and  A3  in  the  x2  direction,  respectively,  shown  in 
Fig.  3(b).  The  overlaps  consist  of  four  nonoverlapped  regions  Bi,  l  =  1,2,  3, 4,  shown  in  Fig.  3(b): 

B\  =  {x  =  (xi,X2):  -e  <xi  <ao,  S  <X2<bi) 

U{x  =  (x  1 ,  X2)  :  — e  <  xi  <  ao,  —Sx\/€  <  X2  <  5} 

B2  =  {x  =  (xi, X2) :  (-xi,X2)gBi) 

B3  =  {x  =  (xi,X2) :  a-i  <  xi  < -€,  ho  <  X2  <  5} 

U{x  =  (xj,  X2) :  —  e  <  x\  <  ao,  bo  <  X2  <  Sx \/e) 

B4  =  {x  =  (xi,  X2) :  (xi,  -X2)  €  S3} 

With  these  overlaps,  the  extended  regions  of  Ay,  denoted  by  Ay  shown  in  Fig.  4,  j  =  1, 2, 3,  are: 
A\  =  B\  U  Ai,  A2  =  B2  U  S4  U  A2,  and  A3  =  S3  U  A3. 

2.2  Theory  for  Construction 

In  order  to  construct  continuous/smooth  orthonormal  bases  for  L2(A)  from  local  orthonormal 
bases  /y,*(x),  k  e  Z,  for  L2(Ay),  j  =  1, 2, 3,  first  we  need  to  extend  /y,*(x)  from  the  rectangular 
regions  Ay  to  the  L-shaped  region  A  and  then  construct  window  functions  Wj  to  window  the  extended 
local  bases.  We  denote  the  extended  bases  of  /y.t  as  //,*,  where  odd  and  even  extensions  are  also 
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^3 


FIGURE  4.  Extended  regions  of  Aj . 


used  as  follows. 


A*(x) 


fi.k  00 


/3.t(x) 


/i.*(x),  x  e  Ai, 

x  =  (*1, x2)  e  B\, 

0,  x  e  A  butx  i  Ai  , 

/2,*(x),  x  €  A2, 

X  =  (xi,x2)  e  £2, 

— /i.t  (xj ,  “X2)  1  x  —  (x] ,  X2)  €  f?4, 
0,  x  e  A  but  x  £  A2  . 

/3,*(x),  x  e  A3, 

/3.*  (*1 .  -X2) .  x  =  (x  1 ,  x2)  e  B3, 

0,  x  €  A  but  x  £  A3  , 


(2.1) 


(2.2) 


(2.3) 


where  k  6  Z.  Clearly,  fj,k(x).  k  6  Z  are  supported  on  the  extended  region  Aj  of  Aj,  j  =  1,2,3. 
The  conditions  on  window  functions  Wj  are  the  following. 


(a)  Wi (x)  =  1  forx  €  Ai  butx  g  B2, 

W2(x)  =  1  for  x  €  A2  but  x  $  B\  U  B3, 

W3(x)  =  1  for  x  e  A3  but  x  £  £4. 

(b)  Wj  (x)  =  0  forx  €  A  butx  £  Aj  for  j  =  1,2, 3. 

(c)  Wi(*i,X2)  =  W2(—xi,x2)  forx  =  (xi,x2)  e  fii  Ufl2, 
W2(*i .  *2)  =  W3(xi,  -x2)  for  x  =  (*i ,  x2)  e  fi3  U  £4. 

(d)  tV,2(x)  +  W22(x)  =  1  forx  €  Si  U  B2, 
w£(x)  +  IV2 (x)  =  1  for  x  e  £3  U  B4. 

The  support  of  Wj  is  also  Aj  for  j  =  1, 2, 3. 
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Finally,  we  form 

Uj,*(x)  =  fj,k(x)Wj(x)>  X  6  A,  j  =  1, 2, 3,  k  e  Z  ,  (2.4)  « 

and  have  the  following  theorem. 

Theorem  L 

The  functions  Uj^{x)for  j  =  1,  2, 3,  k  6  Z,  andx  6  A,  form  an  orthonormal  basis  for  L?(A) 
when  fM(x),  k  e  Z,  x  e  Aj,form  an  orthonormal  basis  for  L2(Aj)f  j  =  1, 2, 3. 

Proof.  The  proofs  of  the  orthogonality  and  the  completeness  of  Uj are  similar  to  the  ones  for 
Malvar  wavelets  on  rectangular  regions  studied  in  [22].  The  details  are  omitted  here.  D 

Remark.  The  odd  and  even  extensions  in  (2.1)  through  (2.3)  are  also  called  the  folding  processes 

in  [1,  3, 4,  13,  17,  19].  The  window  functions  in  (a)  through  (d)  also  can  be  characterized  similar  to  * 

the  one-dimensional  case  studied  in  [4].  □ 

< 

3.  A  Family  of  Continuous/Smooth  Local  Sinusoidal  Bases  on 

L-Shaped  Regions 


With  the  general  theory  in  Section  2.2  we  want  to  construct  a  family  of  continuous/smooth 
local  sinusoidal  bases  on  L-shaped  regions.  The  local  bases  //,*  are  separable  sine  bases  as  follows. 

si„  (k,  +  i)  ,  (3.1) 

-  Wo = J)(t|  . (*  (*'  +  0 
si"(,'(t!  +  0^)  '  <3  2) 
=  V(o.  -  a-,')  W  -  gf”  ("  (*' +  0 

<3-3) 

where  k\ ,  kj  =0, 1,2,  — 

Next,  we  construct  window  functions  Wj  (x) .  The  idea  for  the  construction  is  the  following. 
We  draw  L-shaped  lines  in  an  L-shaped  region  A,  which  are  parallel  to  the  boundary  of  A  (see 
Fig.  5).  We  treat  these  lines  as  one-dimensional  domains  where  one-dimensional  window  functions 
are  defined.  The  overlaps  for  these  one-dimensional  window  functions  are  the  intervals  bounded  by 
the  dots,  i.e.,  the  intersections  of  the  L-shaped  lines  with  regions  B\  U  Bi  and  £3  U  £4.  There  are 
two  kinds  of  such  L-shaped  lines.  For  the  first  one,  the  overlaps  are  separated  and  for  the  second 
one,  the  overlaps  are  adjacent  but  do  not  intersect  except  at  the  boundaries  (see  Fig.  5).  Notice  that 
these  lines  are  not  closed,  which  is  not  like  the  case  for  hexagons  studied  in  [23]. 

With  the  above  idea,  the  following  construction  for  window  functions  Wj  follows. 

Step  1.  1-regions. 


< 


« 


c 


•* 


Wi(x) 


1  forx  €  Ai  butx  $  £2  , 
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A 


FIGURE  5.  Window  functions  constructed  from  one-dimensional  window  functions. 


W2(x)  =  1  for  x  €  A2  but  x  £  B\  U  £3  , 
Wi(x)  =  1  for  x  e  A3  but  x  £  B4  . 


Step  2.  0-regions. 

Wj(x)  =  0  for  x  €  A  but  x  £  Ay,  j  =  1,  2,  3  . 


Step  3.  Overlapped  regions. 
On  the  region  Bi  U  £2, 

Wi(x)  =  Wi(xi,x2)  « 
W2(x)  =  W2(jd,jc2)  - 
On  the  region  S3  U  £4. 

W2(x)  =  W2(xlfjc2)  = 
W3(x)  as  W3(XI,X2)  = 


sin  (£{*1  -ao+e}). 
sin(^{*l  -ao  +  f  e})  . 
cos(£{*i  -a0  +  e}). 

cos  (43^*1  ~ao  +  ^f))  - 

sin  (33  {x2~bo  +  5}), 
sin  (illf  {-*2  -  bo  +  . 

CO®  (Jj  {*2 -i>0  +  <$}), 
C0S(^lf  {^2  -bo  +  ^5}). 


\x\\<e,&<X2<b\, 

l*il  <  %*,bo  <  *2  <  <$, 

1*1 1  <  e,8  <  X2  <  b\ . 
1*1 1  5  <X2  <8 


l*2l  <  8,  a- 1  <  *1  <  -e, 
l*2l  <  ^8,  -€  <  x\  <  ao  , 

l*2l  <S,a_i  <*1  < 
l*2l  <  -r1^.  — €  <  *1  <  ao  • 


For  general  one-dimensional  window  functions  W/(*),  the  construction  of  Wj  (x)  on  the  regions 
B\  U  f?2  and  B3  U  £4  can  be  obtained  by  replacing  the  above  sin  and  cos  with  Wj  properly.  Notice  that 
the  above  Wj  (x)  are  continuous  everywhere  else  inside  A  but  at  the  origin  and  are  basically  generated 
from  one-dimensional  window  functions  for  smooth  local  sinusoidal  bases.  With  the  construction 
of  smooth  one-dimensional  window  functions  Wj(x),  it  is  possible  to  construct  smooth  Wj (x)  in 
the  sense  of  continuous  8*1+*2  W/(xj,  *2)/3*f1  dx*2  for  some  nonnegative  integers  ki  and  kj  when 
(*1  >  *2)  7 t  (0,0). 


60 


Xiang-Gen  Xia 


Extend  the  local  bases  fj,k\te  in  (3.1)  through  (3.3)  from  the  domains  Aj  to  A  according  to 
the  extension  method  in  (2.1)  through  (2.3),  which  are  denoted  by  fj,k\,k2'  Then,  form  W/\*lt*2(x)  = 

fjMM(x)Wj(x)  for  j  =  1, 2, 3  and  ku  k2  =  0, 1,  2 . Since  functions  //,*,,*2(x)  are  zero  at  the 

origin,  the  windowed  basis  elements  Uj^x ,k2 (x)  are  continuous  in  the  L-shaped  region  A.  The  above 
window  functions  Wj  clearly  satisfy  conditions  (a)  through  (d).  This  proves  the  following  result. 

Theorem  2. 

The  above  constructed  functions  Uj^uk2  (x)>  j  =  1, 2, 3  andk\,ki  —  0, 1, 2, . . are  continu¬ 
ous  and  form  an  orthonormal  basis  for  L2(A). 

Next  we  want  to  see  some  numerical  examples.  Let  a-\  =  —2,  ao  =  0,  a  i  =  1,  h-i  =  —3, 
bo  =  0,  b i  =  2,  €  —  0.5,  and  5=1.  Figures  6  through  8  show  the  window  functions  Wj(x)  for 
j  =  1,2,  3,  respectively.  Figures  9  through  11  show  the  basis  elements  Ujt i,i(x)  for  j  =  1,  2,  3, 
respectively. 


4.  LOT  on  L-Shaped  Regions  and  Application  in  Image 

Compression 

In  this  section,  we  want  to  briefly  introduce  the  construction  of  LOT  on  L-shaped  regions. 
Then,  we  show  a  numerical  example  that  shows  that  the  SNR  of  the  LOT  on  L-shaped  regions 
performs  better  than  the  one  of  the  LOT  on  rectangular  regions  or  the  block  DCT. 

The  main  difference  between  discrete-time  and  continuous-time  local  sinusoidal  bases  is  that 
the  variables  x\  and  X2  are  integers  and  the  overlap  sizes  e  and  S  are  also  integers.  As  an  example 
of  constructions  of  discrete-time  local  sinusoidal  bases  or  LOT  on  L-shaped  regions,  we  consider 
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FIGURE  12.  Discrete  L-shaped  region  and  the  overlaps. 
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FIGURE  13.  SNR  performance  comparison. 


FIGURE  14.  Mixed  regions  of  L-shaped  and  rectangular  regions. 
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three  8x8  blocks  shown  in  Fig.  12.  The  overlap  sizes  are  €  =  8  =  4.  Notice  that,  unlike  the 
continuous-time  case,  the  intersection  set  between  B\  U  Bi  and  £3  U  B4  in  the  discrete-time  case  is 
empty.  The  overlaped  regions  are  shown  in  Fig.  12,  too.  The  rest  is  similar  to  the  continuous-time 
case  by  replacing  the  continuous  variables  xt  =  n,*  +  1/2.  For  more  details,  see  [20,  21,  22]. 

We  have  implemented  a  numerical  example  on  applications  of  LOT  on  L-shaped  regions  in 
image  compression.  The  test  image  has  size  96  x  96  that  is  chosen  for  the  convenience  of  the 
blocking.  The  signal-to-noise  (SNR)  ratio  curves  are  shown  in  Fig.  13.  One  can  clearly  see  the 
improvement  of  the  LOT  on  L-shaped  regions  over  the  LOT  on  rectangular  regions. 


5.  Conclusions 

In  this  article,  we  have  constructed  continuous/smooth  local  sinusoidal  bases/Malvar  wavelets 
on  L-shaped  regions,  which  is  motivated  from  object-based  video  coding.  It  is  not  hard  to  generalize 
the  construction  for  mixed  regions  of  rectangular  and  L-shaped  regions,  such  as  Fig.  14  with  solid 
lines.  An  important  point  for  the  construction  is  the  design  of  overlaps.  For  the  region  shown  in 
Fig.  14,  an  overlap  design  is  also  shown  with  dotted  lines.  In  this  article,  we  use  the  construction 
for  L-shaped  regions  and  the  separable  construction  or  nonseparable  construction  studied  in  [22]  for 
rectangular  regions.  Notice  that  the  construction  of  continuous/smooth  local  sinusoidal  bases  at  the 
most  outside  boundaries  does  not  include  any  overlaps.  This  approach  has  been  recently  used  in  [24] 
for  image  compression. 
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Abstract — In  conventional  wavelet  transforms,  prefiltering  is 
not  necessary  due  to  the  lowpass  property  of  a  scaling  function. 
This  is  no  longer  true  for  multiwavelet  transforms.  A  few  research 
papers  on  the  design  of  prefilters  have  appeared  recently,  but 
the  existing  prefilters  are  usually  not  orthogonal,  which  often 
causes  problems  in  coding.  Moreover,  the  condition  on  the  pre¬ 
filters  was  imposed  based  on  the  first-step  discrete  multiwavelet 
decomposition.  In  this  paper,  we  propose  a  new  prefilter  design 
that  combines  the  ideas  of  the  conventional  wavelet  transforms 
and  multiwavelet  transforms.  The  prefilters  are  orthogonal  but 
nonmaxi  nially  decimated.  They  are  derived  from  a  very  natural 
calculation  of  multiwavelet  transform  coefficients.  In  this  new 
prefilter  design,  multiple  step  discrete  multiwavelet  decomposi¬ 
tion  is  taken  into  account  Our  numerical  examples  (by  taking 
care  of  the  redundant  prefiltering)  indicate  that  the  energy  com¬ 
paction  ratio  with  the  Geronimo-Hardin-Massopust  2  wavelet 
transform  and  our  new  prefiltering  is  better  than  the  one  with 
Daubechies  DA  wavelet  transform. 


I.  Introduction 

OW  THAT  single  wavelet  transforms  are  well- 
understood,  multi  wavelets  recently  have  attracted  much 
attention  in  the  research  community;  see,  for  example, 
[l]-[20],  [26]-[32],  where  several  wavelet  functions 

and  scaling  functions  are  used  to  expand  a  signal.  The 
multiwavelet  functions  constructed  by  Geronimo  et  ai  [2]-[4] 
have  more  desired  properties  than  any  single  wavelet  function, 
such  as  short  support,  symmetry,  and  smoothness.  Although, 
in  theory,  they  look  more  attractive  than  single  wavelets, 
not  much  more  advantages  in  practical  applications  over 
single  wavelets  have  been  found  so  far.  In  this  author’s 
opinion,  the  main  reason  behind  this  fact  might  be  because  of 
their  improper  discrete  implementations.  For  single  wavelet 
transforms,  the  discrete  implementation  automatically  follows 
from  their  multiresolution  structure,  i.e.,  tree-structured  two- 
channel  filterbanks.  In  the  tree-structured  filterbank,  lowpass 
and  highpass  filters  are  explicitly  used,  which  is  tight  with 
the  lowpass  and  the  bandpass  properties  of  the  scaling  and 
wavelet  functions,  respectively.  Although,  for  multiwavelet 
transforms,  the  discrete  implementation  also  follows  from 
their  multiresolution  structure,  the  tree-structured  filterbank 
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becomes  a  tree-structured  vector  filterbank  [1],  [8]  (or  time- 
variant  filterbank  [13]).  For  a  tree-structured  vector  filterbank, 
the  lowpass  and  the  highpass  properties  for  the  two  vector 
filters  are  not  as  clear  as  those  for  the  two  filters  in  single 
wavelet  transforms.  It  has  been  found  in  [1],  [16]— [17]  that 
in  order  to  have  a  reasonable  decomposition  for  discrete 
multiwavelet  transforms,  prefiltering  is  necessary.  A  prefilter 
design  method  was  introduced  in  [1],  [16]— [17],  where  the  idea 
is  based  on  the  computability  of  the  multiwavelet  transform 
coefficients  from  uniformly  sampled  signals.  Moreover,  an 
interpretation  of  the  “lowpass”  and  “highpass”  properties  for 
vector  filters  was  introduced  in  [1]  for  the  prefilter  design 
criterion.  The  criterion  is,  however,  only  good  for  the  first  step 
discrete  multiwavelet  transform  decomposition.  The  prefilters 
designed  with  this  method  may  be  nonorthogonal,  which 
might  kill  the  gain  of  the  energy  compaction  in  the  transform 
domain  after  the  decoding  is  performed.  In  [31],  a  different 
approach  was  proposed  for  perserving  the  orthogonality  by 
using  the  approximation  order  criterion.  In  [32],  balanced 
multiwavelets  were  studied,  where  prefiltering  for  these  kinds 
of  multiwavelets  is  not  necessary,  but  other  properties,  such 
as  the  short  supportness  and  the  smoothness,  are  not  as  good 
as  the  GHM  multiwavelets.  Notice  that  in  [1]  and  [8],  it  was 
also  mentioned  that  when  the  “lowpass”  filter  H(u>)  satisfies 
H(0)  =  /,  prefiltering  is  not  necessary. 

In  this  paper,  we  introduce  a  new  prefilter  design  by 
combining  ideas  in  single  wavelet  transforms  and  multiwavelet 
transforms  as  follows.  We  first  construct  a  function  <p(t)  with 
the  lowpass  property,  i.e.,  its  Fourier  transform  <j>(u>)  is  1 
at  u;  =  0,  or  <£(0)  =  1,  from  the  multiscaling  functions 
and  their  translations  such  that  <f>(t  -  n),  n  G  Z  form  an 
orthonormal  set.  Notice  that  the  function  (f>  does  not  have  to 
be  a  scaling  function  since  the  nested  property  is  not  required, 
i.e.,  a  dilation  equation  may  not  be  satisfied.  Due  to  the 
lowpass  property,  a  signal  f(t)  can  be  well  approximated  by 
a  linear  combination  of  2J/2<f>(2Jt  —  n),  n  €  Z  for  a  large  J; 
meanwhile,  f(t)  can  also  be  well  approximated  by  a  linear 
combination  of  the  multiscaling  functions  and  their  trans¬ 
lations  due  to  their  multiresolution  approximation  property. 
Because  of  the  lowpass  property  of  <j>  and  the  orthogonality 
of  <f>(t  -  n),  the  coefficients  in  the  linear  combination  of 
2Jf2<j)(2Jt  -  n),  n  €  Z  are  proportational  to  /(n/2J);  see, 
for  example,  [23]— [25],  and  [35].  The  conversion  between 
these  two  approximations  naturally  suggests  a  prefiltering 
for  computing  the  multiwavelet  transform  coefficients  at  the 
highest  resolution  (or  called  approximation  coefficients)  from 
the  samples  f(n/2J)  of  the  signal  /.  Then,  the  rest  of 
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the  multiwavelet  transform  coefficients  (the  lowest  resolution 
coefficients  and  the  detailed  coefficients)  follows  from  a  tree- 
stmctured  vector  filterbank  [1],  [8].  We  will  see  later  that 
the  lowpass  condition  imposed  on  the  function  O  is  strongly 
related  to  the  lowpass  condition  imposed  on  the  combined 
filters  of  the  prefilters  and  the  multiscaling  functions,  which 
also  relates  to  the  one  imposed  on  the  combined  filters  of 
the  prefilters  and  the  cascaded  vector  filterbanks,  i.e.,  multiple 
steps  of  the  discrete  multiwavelet  transform  decompositions. 
Notice  that  the  above  prefilter  structure  was  first  used  in 
[30],  but  neither  the  lowpass  condition  on  the  function  4>  nor 
any  rationale  for  introducing  such  <f>  was  mentioned.  Instead, 
in  [30],  signal-dependent  optimal  prefilters,  in  terms  of  the 
energy  compaction  criterion,  were  designed.  The  drawbacks 
are  1)  that  the  computational  load  is  high  and  2)  the  signal 
dependency.  In  this  paper,  we  systematically  study  the  prefilter 
structure  and  its  rationale.  The  prefilters  are  signal  independent 
and  orthogonal,  and  they  only  depend  on  multiwavelets. 


Let  /  e  Vj;  then 

/(f)  =  W 

/=1  z 

N 

Cl'JotfrjoA*) 

i= i  fee z 

+  ^  dijikilHjlk(t)  (2-7) 

/=i  j0<j<J  fee z 

where  Jo  <  J,  and 

cij,k  =  J  f(t)4>uAt)dt 

and 

du,k  =  J  fittyijAtydt- 

Let 


II.  Approximation  of  Lowpass  Functions  Using 
Multiscaling  Functions  and  New  Prefilter  Structure 

In  this  section,  we  want  to  motivate  a  new  prefiltering  for 
multiwavelet  transform  coefficient  computation  by  approxi¬ 
mating  a  lowpass  function  using  multiscaling  functions.  To 
do  so.  let  us  first  briefly  review  multiwavelets  and  matrix 
dilation  equations.  For  more  details  about  multiwavelets,  see, 
for  example,  [  1  ]— [20]  and  [26]— [32]. 

Consider  .V  compactly  supported  scaling  functions 

l  —  i,2 . A'  and  their  corresponding  N  mother  wavelet 

functions  I  =  1.2 where  all  the  translations 

4>i(t  -  k).  k  €  Z.  /  =  1,2, ...,N  are  mutually  orthogonal, 
and  tln.j.k  =  2'/2v>/(2 H  -  k),  j,k  e  Z,  /  =  1,2,..., AT  form 
an  orthonormal  basis  for  L2( R).  Let  H(w)  and  G(u)  be  their 
corresponding  A  x  A'  matrix  quadrature  mirror  filters  with 
i\  x  A’  impulse  response  constant  matrices  Hk  and  Gk,  k  €  Z, 
respectively.  Let 

<h(t)  =  (<M0 . <t>s(t))T.  #(*)  =  (i/’i(f),  •  •  • ,  i/’iv(f))T- 

Then,  we  have  the  following  matrix  dilation  equations. 

$(/)  =  2^2  Hk9{2t-k)  (2.1) 

fe 

H>(t)  =  2  ^2  Gk${2t  -  k).  (2.2) 

k 

The  orthogonality  implies 

H(w)Ht(u;)  +  H(w  +  +  v)  =  In  (2.3) 

.  G(w)G\u>)  +  G(ut  +  7r)G^(w  +  7r)  =  In  (2.4) 

H(w)G^(u>)  +  H(ut  +  7r)G^(a>  +  it)  =  On  (2.5) 

where  f  means  the  complex  conjugate  transpose,  and  In 
and  0^  denote  the  N  x  N  identity  and  the  all-zero  matrix, 
respectively. 

For  each  fixed  j  €  Z,  let  Vj  be  the  closure  of  the  linear  span 
of  =  2i/2<pi{2jt  -  k),  l  =  1, 2, . . . ,  N,  k  e  Z.  Then,  the 
spaces  Vj,  j  €  Z  form  an  orthogonal  multiresolution  analysis 
for  L2(R). 


c j,k  =  (ClJ,k,  ■  ■  • ,  CN,j,k)T 

and 

dj,k  =  (dltj,k:  ■■■,  dN,j,k)T- 
Then,  by  the  matrix  dilations  (2.1)-<2.2) 

cJ-1,fc  =  %/2X^cj,2fc+n  (2.8) 

n 

dj-ifjb  =  ^E^c,2fe+n  (2.9) 

n 

and 

cj,n  =  -1,2  fe+n  +  Gfcdj-i,2fc+n)‘  (2.10) 

fe 

Thus,  to  determine  the  multiwavelet  transform  coefficients 
Cj0ifc  and  dj  k  for  J0  <  j  <  J,  k  €  Z  from  /,  it  is  good 
enough  to  determine  the  coefficients  c jyk  for  k  €  Z  from  /. 

Unlike  single  wavelets,  where  cjyk  is  proportional  to  the 
samples  f(k/2J)  when  J  is  large  enough  due  to  the  lowpass 
property  of  a  single  scaling  function,  the  determination  of  cj,fc 
for  multi  wavelet  transforms  from  the  samples  of  f(t)  is  not 
trivial.  When  the  multiscaling  functions  have  the  interpolating 
property,  the  determination  was  given  in  [1]  and  [16]— [17]. 
Furthermore,  a  necessary  and  sufficient  condition  for  the 
solvability  of  cJyk  from  the  samples  of  /  was  also  given 
in  [1].  The  relationship  between  the  samples  of  /  and  the 
coefficients  c Jik  automatically  provides  a  prefiltering  for  the 
multi  wavelet  transform  computation  from  the  samples  of  /. 
For  more  details,  see  [1].  Unfortunately,  the  prefiltering  based 
on  this  relationship  is  usually  not  orthogonal,  which  seems  to 
limit  the  gain  in  the  compression  applications. 

In  order  to  present  our  new  prefilter  design  method,  i.e.,  a 
new  relationship  between  the  samples  of  /  and  c  j,k,  let  us  look 
at  the  conventional  wavelet  transform  coefficient  computation, 
which  is  usually  referred  as  the  Mallat  algorithm. 

Let  <j>(t)  be  a  single  orthogonal  scaling  function.  Then,  for 
any  signal  f(t ),  there  exists  J  >  0  such  that  f(t)  can  be  well 
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approximated  by  </>j,k(t)  =  2J/20(2 Jt  -  k),  k  e  Z,  i.e., 

k 


where 

cj,k  =  J  mtjAi )  dt  (2-12) 

The  relationship  oc  in  the  above  formula  is  because  of  the 
lowpass  property  of  0(f),  i.e.,  0(0)  =  1,  see,  for  example, 
[23]— [25]  and  [35].  The  rest  of  wavelet  transform  coefficients 
can  be  calculated  recursively  from  cj,*.  The  key  point  for  the 
validation  of  (2.1 1)— (2.12)  is  that  the  scaling  function  0(f)  has 
the  lowpass  property,  and  0(f  -  fc),  k  G  Z  are  orthogonal. 

Motivated  from  the  above  observation,  we  now  want  to 
construct  a  function  0(f)  from  the  multiscaling  functions  0/(f), 
l  =  1, 2, . . . ,  N  such  that  0(f)  has  the  lowpass  property,  and 
its  translations  0(f  -  k),  k  G  Z  are  orthogonal  to  each  other. 
Notice  that  such  0(f)  may  not  be  a  scaling  function  because 
it  may  not  satisfy  any  dilation  equation.  As  long  as  0(f) 
has  the  lowpass  property  and  the  orthogonality,  the  properties 
(2.1 1 H2. 12)  hold  for  a  signal  /. 

Let 

N 

4>{t)  =  -  n)  (2.13) 

1=1  n 

where  a/[n]  are  real  constants.  Then 

N 

0(w)  =  ^2  A/(u;)0/(u/)  (2.14) 

/=i 

where 

=  ^2  a/[n]e“jnu;.  (2.15) 

n 

The  lowpass  property  implies 
N 

4>(0  )  =  £a,(0)M0)  =  1.  (2.16) 

1=1 

The  orthogonality  of  0(f  —  n),  n  G  Z  is  equivalent  to 


+  2xn)|2  =  1. 


(2.17) 


Write  out  the  right-hand  side  of  (2.17)  as 
^2  \4>{u  +  2;rn)|2 

n 

=  y  f  ]l  Ai'  (w + 2,rn)) 

n  \/,=l  / 

x  ^  53  A*i  (w + 2nn)  j 


\h 
N  N 


=  53  53  ^  ^Ah  (w)  53 4>h  + 27rn)- 

i,  =  l/2=l  n 


By  the  orthognality  of  <f>i(t  —  n),  l  =  1, 2, . . . ,  N,  n  G  Z,  it 
is  not  hard  to  see  that 

y,  4>h  +  2irn)$i3 ( u>  +  2irn)  =  6(1 1  -  /2)- 

n 


Therefore 

N 

53  l<Hw  +  27rn)|2  =  y  |4((u;)|2- 

n  1=1 

This  implies  that  the  orthogonality  of  <j>(t  -  n),  n  €  Z  is 
equivalent  to 

N 

yi^M|2  =  l-  (2-18) 

1=1 

In  conclusion,  we  have  proved  the  following  lemma. 

Lemma  1:  A  linear  combination  0(f)  in  (2.13)  of  multi¬ 
scaling  functions  0/(f)  and  their  translations  has  the  lowpass 
property  and  the  orthogonality  of  its  translations  if  and  only 
if  the  properties  (2.16)  and  (2.18)  hold. 

We  now  assume  0(f)  in  (2.13)  satisfies  the  lowpass  property 
(2.16)  and  the  orthogonality  (2.18).  For  a  given  signal  /(f), 
by  the  lowpass  property  of  0(f),  there  exists  a  J  >  0  such 
that  (see,  for  example,  [35,  Prop.  5.3.2,  p.  142]) 

/(<)«^U^(2Jt-n)  (2.19) 

n 

where 

bn  =  J  f(t)2J/2<j>(2Jt  -  n)  dt. 

An  estimate  of  the  difference 

/(*)  -  X>»2J/V(2Jf  -  n ) 

n 

is  given  in  the  Appendix.  Notice  that  the  only  condition  on 
0(f)  for  the  relationship  (2.12)  to  hold  is  the  lowpass  property, 
i.e.,  0(0)  =  1.  Therefore,  similar  to  (2.12),  we  have 

bn  oc  for  large  J. 

Without  loss  of  the  generality,  we  may  assume  J  =  0  for 
simplicity.  Then 

/(«) «  y -  n)  311(1  b* « /(»)• 


From  (2.13) 

/W*£M(  t-n) 

n 

N 

=  53  bn  53  53  ai[m]<j>i(t  -n-m) 

n  /=1  m 

=  53  53  (y  6fc_ma,[m]  J^(f-fc). 

(=1  It  \  m 

This  implies  that 

y  «  c/,o,it) 


l  =  l,2,...,N,keZ  (2.20) 
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Fig.  1.  New  prefiltering:  Decomposition. 


m 


A2  fa) 


^  * 


A> 


Fig.  2.  New  prefiltering:  Reconstruction. 


where  6„  a  f(n).  n  e  Z.  The  above  result  (2.20)  suggests 
the  following  new  relationship,  i.e.,  a  new  prefilter,  between 
the  samples  f(n)  of  /(<)  and  the  multiwavelet  transform 
coefficients  c/.o.t 

ci.o.k  =  ^2  f(k  -  m)ai[m]  (2.21) 

m 

which  is  shown  in  Fig.  1. 

By  the  orthogonalities  of  multiwavelets  (2.3M2.5)  and 
prefilters  (2.18),  the  reconstruction  can  be  shown  in  Fig.  2. 

The  difference  between  the  above  prefilter  bank  and  the 
prefilter  bank  proposed  in  [1]  is  the  following.  The  above 
prefilter  bank  is  not  maximally  decimated,  i.e.,  redundancies 
are  introduced.  Actually,  the  number  of  coefficients  in  the 
transform  domain  is  increased  by  N  times.  The  prefilter  bank 
in  [1]  is,  however,  maximally  decimated,  and  no  redundancy 
is  introduced.  We  might  want  to  ask,  since  we  are  usually 
interested  in  reducing  the  redundancies,  why  we  need  to 
introduce  redundancy  here.  The  answer  here  is  two-fold. 
First,  proper  overcomplete  (or  redundant)  transforms  plus 
vector  quantizations  might  perform  better  than  nonredundant 
transforms.  This  suggests  that  including  redundancy  in  the 
transform  might  not  be  a  bad  idea  due  to  its  better  tolerance  of 
noise  than  nonredundant  transforms.  Second,  from  our  numeri¬ 
cal  examples,  the  energy  compaction  with  this  new  prefiltering 
is  better  than  the  one  with  Daubechies  wavelet  transform 
after  the  nonmaximality  of  the  decimation  in  prefiltering  has 
been  taken  into  account. 


Notice  that  the  energy  of  /(n)  is  preserved  after  the  whole 
discrete  multiwavelet  transform  in  Fig.  1  is  performed  due 
to  the  orthogonalities  of  the  multiwavelet  transform  and  the 
prefilter  bank,  although  the  prefilter  bank  is  nonmaximally 
decimated. 

Motivated  from  the  above  prefiltering  and  the  one  in  [1], 
we  propose  the  following  general  prefiltering  for  discrete 
multiwavelet  transforms,  which  is  shown  in  Fig.  3,  where 
1  <  K  <  N  and  the  pre/post  filterbank  shown  in  Fig.  4 
have  the  perfect  reconstruction  property.  Specifically,  when 
the  filterbank  in  Fig.  4  is  paraunitary,  the  prefiltering  in  Fig.  3 
is  orthogonal. 


m.  Prefilter  Design  and  Examples 

In  this  section,  we  first  study  the  general  N  wavelet  case 
and  then  study  the  case  of  N  =  2.  Finally,  we  look  at  two 
examples.  One  is  the  Geronimo— Hardin— Massopust  2  wavelet 
prefilter  design,  and  the  other  is  the  prefilter  design  for  one  of 
the  2  wavelets  obtained  by  Chui  and  Lian  in  [20]. 

A.  General  N  Wavelet  Prefilter  Design 

Although,  for  the  general  prefilter  bank  in  Fig.  3  (i.e., 
general  K ),  the  interpretation  in  the  previous  section  does  not 
hold,  the  design  of  a  prefilter  bank  Ai{u)  can  be  done  using  the 
same  criterion  given  in  [1],  where  K  =  N.  In  the  following, 
we  focus  on  the  case  of  K  =  1  and  use  the  interpretation  in 
Section  II  to  design  the  prefilter  bank  Ai(u).  Moreover,  we  are 
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deoaipcsitiai 


racxiBtoctiai 

Fig.  3.  General  prefiltering:  Decomposition  and  reconstruction. 


Fig.  4.  Pre/post  filterbank. 


only  interested  in  designing  FIR  prefilter  banks.  The  lowpass  where  ur  =  (uri, . . .  ,uwv)  and  the  norm  of  the  vector  ur 
and  the  orthogonality  conditions  (2.16)  and  (2.18)  will  be  used,  is  1,  i.e.. 

Due  to  its  orthogonality,  any  FIR  prefilter  bank  Ai(oj)  can  N 

be  factorized  as  (see,  for  example,  [33],  [34])  ^  |u  f|2  _  ^ 


where 


/Ai(uj)  \ 

(M0)\ 

1  From  the  matrix  dilation  equation,  we  have 

*  - 

=  VM- 

• 

(3.1) 

1 

(M0)\ 

/MO)  \ 

V4;v(0)  / 

• 

=  H(0) 

\0isr(O)/ 

\0Ar(O)/ 

N 


Ei^(°)i2  =  1 


1=1 


(3.4) 


(3  2)  ^en  H(u>)  is  known,  the  vector  (0(0), . . .  ,4>n(0))  can  be 
solved.  Then,  the  orthogonality  and  the  lowpass  property 
(2.16)  and  (2.18)  are  equivalent  to 


N 


N 


Ur(w)  =  IN  +  (e-*  -  l)u*ur 


(3-3) 


^|^(0)|2  =  1  and  5>(O)0,(O)  =  1.  (3.5) 


1=1 


/  =  1 


and 
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The  only  constraint  for  the  parameters  uri  is  that  they  need 
to  be  of  the  unit  norm  for  r  =  1, 2, . . . ,  p.  The  parameter  p 
determines  the  prefilter  length  and  is  called  the  order  of  the 
prefilter  (v4/(w))I=i,2,...,iv.  When  there  is  no  Ur(u>)  term  in 
(3.1),  we  set  p  =  0,  i.el,  the  order  of  the  prefilter  is  zero. 

Additional  conditions  may  be  imposed  on  the  above  param¬ 
eters.  An  important  one  is  that  the  combined  filters  of  Ai(w) 
and  H(o>)  need  to  be  lowpass  filters,  and  the  combined  filters 
of  Ai(w)  and  G(w)  need  to  be  highpass  filters.  The  reason 
for  this  condition  is  the  same  as  what  was  proposed  in  [1], 
i.e.,  we  need  to  keep  the  “lowpass”  part  and  decompose  it 
again  and  again  but  quantize  the  “highpass”  part  and  therefore 
keep  the  “highpass”  part  as  small  as  possible.  This  means  the 
“highpass”  part  needs  to  be  the  high-frequency  part;  otherwise, 
it  will  have  a  lot  of  energy. 

By  thinking  of  the  multiscaling  vectors  as  the  cascaded 
version  of  the  “lowpass”  vector  filter  H(w),  the  new  lowpass 
property  (2.16)  for  the  function  <j>  means  the  lowpass  property 
for  the  combined  filters  of  the  prefilters  Ai(u)  and  cascaded 
vector  filters  H(w).  Therefore,  the  above  two  lowpass  condi¬ 
tions  [the  new  one  (2.16)  and  the  old  one  in  [1]]  somewhat 
guarantee  the  lowpass  properties  of  the  all-approximation 
multiwavelet  transform  coefficients  Cj,*  -for  Jo  <  j  <  0- 
The  old  lowpass  condition  in  [1]  is  for  the  lowpass  property 
of  the  first  step  decomposition  and  the  new  lowpass 

condition  in  this  paper  is  for  the  follow-up  decompositions 
Cj  k  for  J0  <  j  <  -1-  The  old  lowpass  condition  in  [1]  can 
be  stated  as  follows. 

There  are  N  combined  filters  of  Ak(uj)  and  H(u>)  and  N 
combined  filters  of  Ai(ur)  and  G(w).  They  are 

,v 

H,(u!)  =  Y,  *  =  1.2 N  (3.6) 

k- 1 
and 

.v 

Gi(uj)  =  Y^Gi,k(v)Ak{u),  l  =  1,2, ...,N  (3.7) 

1 


rmHrne  to  ctecrnpose 


qantiae 


Fig,  5.  Combined  filters  of  prefilters  and  multiwavelet  filters. 

B.  Theory  for  2  Wavelets 

Since  there  always  exists  a  solution  for  (3.4),  there  exist 
two  real  constants  a  and  b  such  that 

4i(0)  +  6<A2(0)  =  0.  (3.10) 

Without  loss  of  generality,  we  may  assume  <£i(0)  =  c<fo( 0) 
for  a  real  constant  c.  Then,  by  (3.5) 

cA1(0)  +  A2(0)  =  l/^2(0)  =  x,  or  A2(0)  =  x  -  cAi(O) 

(3.11) 

and 


respectively,  where  H(u;)  =  (Hi^(^))nxN>  and  G(w)  = 
(Gitk(v))sxs-  Then,  die  prefiltering,  the  first  step  multi¬ 
wavelet  transform  decomposition,  and  their  combined  filters 
can  be  shown  in  Fig.  5. 

The  lowpass  property  on  Hi(u)  is 

N 

Y,  H,,k(*)Ak(*)  =  0,  1  =  1,2,  (3.8) 

k=\ 

The  highpass  property  on  Gi(uj)  is 


(1  +  c2)A?( 0)  -  2xcA\ (0)  +  a:2  -  1  =  0,  or 

a  (n\  -  xc  ±  +  c2  ~  — 

Al(°)"  1  +  C2 


where  x  is  an  arbitrary  constant.  This  implies  that  there  always 
exist  solutions  for  (3.5). 

When  matrix  G(0)  has  full  rank,  the  only  solution  for  (3.9) 
is  Ai(0)  =  A2(0)  =  0,  which  does  not  satisfy  (3.5). 

When  matrix  G(0)  does  not  have  full  rank,  there  exist 
solutions  for  At( 0),  l  =  1,2  in  (3.9),  i.e.,  there  exist  two 
real  constants  d  and  e  such  that 


N 

^Gz,fc(0)Afc(0)  =  0,  /  =  1,2, . . . ,  iV.  (3.9) 

k=l 

In  conclusion ,  the  above  four  conditions  [i.e.,  (3.1),  (3.5), 
(3.8),  and  (3.9)]  need  to  be  imposed  on  the  prefilter  design 
given  a  multiwavelet. 


dAi(0)  =  eA2(0).  (3.13) 

Clearly,  there  exists  a  solution  for  Aj(0)  and  A2(0)  in 
(3.11  M3. 13). 

Now,  the  only  condition  left  is  (3.8).  Although  the  existence 
of  the  zeroth-order  prefilter  (Ai(w),  .42(o;))  =  (Ai(0),  A2(0)) 
in  (3.8)  depends  on  the  form  of  (4li(0),^42(0))  and  H(0)  (we 
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will  see  later  that  there  does  not  exist  any  zeroth-order  prefilter  Solving  (3.4)  and  (3.5),  we  have 
that  satisfies  (3.8)  for  the  GHM  2  wavelets,  but  there  does  exist 

for  one  of  the  2  wavelets  obtained  by  Chui  and  Lian  in  [20]),  .  _  x\[2  ±  —  x2  ^  _  x  T  y/3  —  x2 

we  may  analyze  first  order  prefilters.  In  this  case  iw  —  3  1  3 

GmwO  =  (i2  +  ~  1){Ze)icos9Ma6)) 

'  2  '  •  '  '  '  '  where  x  is  a  real  constant  with  |x|  <  \/3- The  condition  (3.9) 

x  (  ]  (3.14)  -  implies 


Ai(u>) 

Ao(&) 


/2  +  (e~Ju;  —  1) 


and 

/%(’>■) 

I'M*) 


■  cos  26/  —  sin 
-  sin  20  cos 


sin 20 \  f  Ai( 
:os2 0  )  \A2( 


^A2{0)  =  4i(0). 


Therefore,  we  solve  for  4/(0)  as 


where  0  is  an  arbitrary  angle. 

For  the  same  reason  as  before,  when  matrix  H(7r)  has  full  _  _  >/3 

rank,  there  are  no  solutions  for  (3.5)  and  (3.8).  Therefore,  we  3  80  2  3 

assume  that  matrix  H(x)  does  not  have  full  rank.  Then,  there 

exist  two  real  constants  it.v  such  that  Then,  the  prefilters  in  (3.1)  can  be  written  as 


uAi(tt)  +  iM2(7t)  =  0. 

By  (3.14) 

(t/.42(0)  -  uAi(O))  cos(20)  =  (u„42(0)  +  vAi(O)) sin(20). 

Thus,  there  exists  an  angle  0  such  that  the  above  equation 
holds.  This  proves  the  following  theorem. 

Theorem  /:  There  exists  a  first-order  prefilter 
(A\(u).  A2(u;))  that  satisfies  all  conditions  [i.e.,  (3.1), 
(3.5),  (3.8),  and  (3.9)]  if  and  only  if  none  of  matrices  H(tt) 
and  G(0)  has  full  rank. 

As  pointed  out  by  one  of  the  referees  of  this  manuscript,  the 
condition  in  the  above  theorem  always  holds  if  a  constant  can 
be  expressed  by  a  linear  combination  of  the  translates  —  k) 
and  4>2(t  -  k)  of  two  scaling  functions  <t>\{t)  and  <£2(f). 


C.  Design  Examples  for  the  GHM  2  Wavelets 

We  first  want  to  see  the  Geronimo-Hardin-Massopust  2 
wavelets  with  the  following  matrix  impulse  responses  of  the 
vector  filters  H(u>)  and  G(u;),  respectively. 

(  3/10  2v/2/5\  (  3/10  0  \ 

0  V-v/2/40  -3/20 )'  1  ^9^2/40  1/2  J 

=  (9V2/W  -3/2o)’  Wa=(-v/2/40  0) 
and 

_  (-V2/40  -3/20  \  _  [9^2/40  — 1/2  \ 

0  ~  V  -1/20  -3V2/20)'  1  _  V  9/20  0  ) 

_  (9V2/40  -3/20  \  r  _  [-V2/40  0\ 

2  y  -9/20  3\/2/20/’  3  ~  V  1/20  °/ 


From  (3.4) 


(3-i7) 

where 

Ur(uj)  =  I2  +  -  1)  )  (ttrl ,  ur2) 

with  v%x  +  u* 2  =  1  for  two  real  constants  ur \  and  ur2.  It 
is  clear  that  (3.8)  implies  that  the  order  p  in  (3.17)  must  be 
greater  than  or  equal  to  1.  Since  matrices  H(x r)  and  G(0) 
do  not  have  full  rank,  by  Theorem  1,  there  exists  a  first-order 
prefilter  satisfying  the  conditions.  Let  us  see  what  it  looks  like. 


where  0  is  an  angle.  Thus 

(Ai(r)\ _  /-cos20  -sin 2('Wv/2\J_ 
(^2(x)J  ~  V-siu2&  cos20,/Vl  )  yfi' 

Therefore,  (3.8)  implies 

I  \Pl 

—  \Fl  sin  20  +  cos  20  =  0,  or  0  =  -  arctan  — .  (3.19) 

z  z 

This  proves  the  following  theorem. 

Theorem  2:  The  prefilter  in  (3.18)  with  the  0  in  (3.19) 
satisfies  all  conditions  we  want,  including  the  following. 

1)  the  lowpass  property  of  <f>(i),  i.e.,  4>{ 0)  =  1; 

2)  the  orthogonality  of  <j>{t  -  n),  n  G  Z  and  the  orthogo¬ 
nality  of  the  prefilter  bank  Ai(uj)  for  l  =  1,2; 

3)  the  lowpass  property  of  the  combined  filters  Hi( u)  for 
/  =  1,2  of  Ai{cj),  l  =  1,2,  and  H(c o); 

4)  the  highpass  property  of  the  combined  filters  Gi(u)  for 
Z  =  1,2  of  j4,(u/),  Z  =  1,2,  and  G(u;). 


<h(0)  -  ^2^(0)  =  0. 


(3.15) 
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As  mentioned  earlier,  the  zeroth-order  prefilter,  i.e.,  without 
any  term  Ur{u)  in  (3.18),  does  not  satisfy  the  above  property 
3),  although  it  satisfies  all  the  rest,  i.e.,  1),  2),  and  4).  Notice 
that  the  above  zeroth-order  prefilter  was  first  used  in  [16] 
and  [17].  When  the  order  p  of  a  prefilter  increases,  better 
lowpass  and  highpass  combined  filters  Hi(w)  and  G,M, 
respectively,  may  be  expected,  and  the  length  of  a  prefilter 
also  increases.  The  final  version  of  the  two  prefilters  in  (3.18) 
can  be  expressed  as 

"'<">  =  |si"2"-575si”29 

+  (§^+m sin29H"  (3'20, 

A2(w)  =  -  -^=  sin  29  +  -^=cos20 

+  <3'21) 


D.  Another  Design  Example 

The  second  example  of  2  wavelets  is  obtained  by  Chui  and 
Lian  in  [20].  The  matrix  impulse  responses  are 


Hc,=  K-/?/4  -jfl*)' 

*-!($«  -X) 


The  multiscaling  and  multiwavelet  functions  are  supported  in 
[0. 2]  and  have  symmetry  and  certain  smoothness.  It  is  clear 
that 


«(°t=Ko 


.  1(0 


g<0>  =  KS 


44 


Conditions  (3.9)  and  (3.5)  imply  that  Ai(0)  =  ±1  and 
42(0)  =  0.  In  this  case,  the  zeroth-order  (Ai(u>),  A2(uj))  = 
(>11(0),A2(0))  =  (1.0)  already  satisfies  (3.8).  As  pointed 
out  by  one  of  the  referees  of  this  manuscript,  this  result 
holds  not  only  for  the  above  Chui-Lian  multiwavelets  but  for 
other  multiwavelets  as  well  as  long  as  one  of  two  scaling 
functions  is  symmetric  and  the  other  of  two  scaling  functions 
is  antisymmetric. 


E.  Numerical  Simulations  for  the  Combined 
Filters  Hi(uj)  and  Gi(u>) 

In  this  section,  we  want  to  illustrate  the  combined  filters 
Hi(lj)  and  Gt{ u>)  for  /  =  1,2  for  the  GHM  2  wavelets. 
Three  sets  of  these  combined  filters  are  illustrated:  without 
prefiltering  [Fig.  6(a)  and  (b)];  old  zeroth-order  orthogonal 


frequency 

(b) 


Fig.  6.  Combined  filters  of  the  GHM  2  wavelets  without  prefiltering,  (a) 
(b)  |G/M|. 

prefiltering  in  [1]  [Fig.  7(a)  and  (b)];  new  orthogonal  prefilter¬ 
ing  in  Theorem  2  [Fig.  8(a)  and  (b)]. 

IV.  Numerical  Experiments 

In  this  section,  we  want  to  see  the  performance  of  our  new 
prefiltering  scheme  through  some  simple  numerical  examples. 
The  first  test  signal  is  the  one  hundredth  horizontal  line  of 
the  Cameraman  image  with  size  256  x  256,  which  is  shown  in 
Fig.  9.  Six  experiments  on  energy  compaction  of  the  following 
six  transforms  are  done.  The  first  transform  T\  is  the  GHM 
2  wavelets  without  prefiltering.  The  second  transform  T%  is 
the  GHM  2  wavelets  with  the  old  zeroth-order  orthogonal 
prefiltering  with  £i  =  l/(10\/3)  and  €2  =  7/ (5 1/6)  in  (3.29) 
in  [1].  The  third  one  T3  is  the  Daubechies  £>4  wavelets.  The 
forth  and  the  fifth  are  the  GHM  2  wavelets  with  our  new 
orthogonal  prefiltering  of  the  zeroth  and  the  first  order,  respec¬ 
tively.  The  sixth  transform  T$  is  the  Chui-Lian  multiwavelet 
transform  in  Section  III-D  with  the  zeroth-order  prefiltering 
(Afyw),  A2(w))  =  (1,0).  Two  step  decompositions,  i.e.,  Jo  = 
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frequency  -  -  frequency 


Fig.  7.  Combined  filters  of  the  GHM  2  wavelets  with  the  old  zeroth-order  Fig.  8.  Combined  filters  of  the  GHM  2  wavelets  with  their  first-order 
orthogonal  prefiltering  in  { I ].  (a)  \H[{u>)\.  (b)  |G/(u;)|.  orthogonal  prefiltering,  (a)  |#/(u?)|.  (b)  |G/(u;)|. 


-2  and  J  =  0,  in  the  first  three  transforms  are  performed, 
where  the  iowpass  part  of  the  transformed  signal  is  of  length 
64,  whereas  the  bandpass  part  is  of  length  192.  Since  our  new 
prefiltering  is  nonmaximally  decimated  and  the  signal  size  in 
the  discrete  multiwavelet  transform  domain  is  twice  of  the 
input  signal  (or  the  output  signals  of  the  first  three  transforms), 
three  step  decompositions,  i.e.,  J0  =  -3  and  J  =  0,  of  the 
discrete  multiwavelet  transform  with  our  new  prefiltering  are 
performed  for  the  last  three  transforms*  where  the  length  of 
the  Iowpass  part  of  the  transformed  signal  is  also  64,  whereas 
the  length  of  the  bandpass  part  is  512  -  64  =  448.  Therefore, 
we  have  the  following  energy  compaction  ratio  definitions. 

The  energy  compaction  ratioes  for  the  first  three  transforms 
Tk  for  k  =  1,2,3  are  defined  by 

r_  ESsM 

£n=i  lyNI2 


original  signal 


Fig.  9.  First  test  signal. 
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decomposition  by  muttiwavelet  transform 


Fig.  10.  Decomposition  of  the  first  test  signal  using  the  GHM  2  wavelets 
without  prefiltering. 


decomposition  by  muttiwavelet  transform 


Fig.  1 1.  Decomposition  of  the  first  test  signal  using  the  GHM  2  wavelets 
with  the  old  zeroth -order  orthogonal  prefiltering  in  [1]. 


TABLE  I 

Energy  Compaction  Ratio  Comparison  for  the  First  Test  Signal 


r 

GHM  2  wavelets  without  prefiltering 

0.1374 

GHM  2  wavelets  with  the  old  0th  order  orthogonal  prefiltermg  in  [I] 

0.1247 

Daubechies  DA  wavelets 

0.1123 

GHM  2  wavelets  with  the  new  0th  order  orthogonal  prefiltering 

0.0896 

GHM  2  wavelets  with  the  new  Ith  order  orthogonal  prefiltering 

0.0722 

Chui-Lian  2  wavelets  with  the  0th  order  orthogonal  prefiltering  j 

0.0944 

where  y[n]  are  the  signals  in  the  transform  domain.  The  energy 
compaction  ratioes  for  the  rest  three  transforms,  i.e.,  with  the 
new  prefiltering,  are  defined  by 

_  SL265  bWI2 
£n=i  bMI* ' 

The  transformed  signals  with  the  first  three  transforms  are 
shown  in  Figs.  10-12,  respectively.  The  transformed  signals 
with  the  new  orthogonal  prefiltering  of  the  zeroth-  order 
and  the  first-order  for  the  GHM  multiwavelets  are  shown  in 
Figs.  13  and  14,  respectively.  The  transformed  signal  with 


decomposition  by  Daubechies  D4  transform 


Fig.  12.  Decomposition  of  the  first  test  signal  using  Daubechies  DA 
wavelets. 


decomposition  by  muttiwavelet  transform 


TABLE  II 

Energy  Compaction  Ratio  Comparison  for  the  Second  Test  Signal 


r 

Daubechies  Da  wavelets 

0.0110 

GHM  2  wavelets  with  the  new  1th  order  orthogonal  prefiltermg 

0.0071 

Chui-Lian  2  wavelets  with  the  0th  order  orthogonal  prefiltermg 

0.0065 

the  zeroth-order  orthogonal  prefiltering  for  the  Chui-Lian 
multiwavelet  is  shown  in  Fig.  15.  Their  energy  compaction 
ratioes  are  listed  in  Table  I. 

The  second  test  signal  is  the  two  hundred  and  fiftieth 
horizontal  line  of  the  Einstein  image  with  size  256  x  256. 
The  original  signal,  the  transformed  signal  with  transform  T3 
(Daubechies  D4  wavelets),  the  transformed  signal  with  trans¬ 
form  T5  (the  GHM  2  wavelets  with  the  first-order  orthogonal 
prefiltering),  and  transform  signal  with  transform  Tq  (the  Chui- 
Lian  2  wavelets  with  the  zeroth-order  orthogonal  prefiltering) 
are  shown  in  Figs.  16-19.  Their  energy  compaction  ratioes  are 
listed  in  Table  II  with  the  same  definitions  as  above. 
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Fig.  14.  Decomposition  of  the  first  test  signal  using  the  GHM  2  wavelets  Fig.  17.  Decomposition  of  the  second  test  signal  using  Daubechies  D4 
with  the  new  first-order  orthogonal  prefiltering.  wavelets. 


Fig.  15.  Decomposition  of  the  first  test  signal  using  the  Chui-Lian  2  wavelets 
with  the  new  zeroth -order  orthogonal  prefiltering. 


Fig.  18.  Decomposition  of  the  second  test  signal  using  the  GHM  2  wavelets 
with  the  new  first-order  orthogonal  prefiltering. 


A  better  energy  compaction  with  the  new  orthogonal  pre- 
filter  than  with  others  can  be  seen  from  the  above  tables. 

V.  Conclusion 

In  this  paper,  we  have  introduced  a  new  prefilter  design  tech¬ 
nique  for  discrete  multiwavelet  transforms.  The  new  technique 
is  based  on  approximating  a  function  with  the  lowpass  property 
and  the  orthogonality  of  their  translations  by  using  linear  com¬ 
binations  of  multiscaling  functions  and  their  translations.  The 
new  prefiltering  is  orthogonal  but  not  maximally  decimated.  It 
deals  with  all  decomposition  steps  for  discrete  multiwavelet 
transforms,  whereas  the  prefiltering  in  [1]  only  focuses  on 
the  first  step  decomposition.  The  decimation  nonmaximality 
allows  one  to  have  more  freedom  in  designing  a  prefilter  so 
that  more  desired  conditions  on  the  prefilters  and  the  combined 
filters  of  the  prefilters  and  multiwavelet  vector  filters  are 
satisfied.  Our  numerical  examples  show  that  a  better  energy 
compaction  ratio  with  the  GHM  2  wavelets  and  the  Chui-Lian 


Fig.  16.  Second  test  signal. 
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When  /  is  bandlimited  with  bandwidth  2Jn,  the  error  can  be 
simplied  as 

|2 

dt 


! 


/(t)-^6n2^(2Jt-n)| 


Notice  that  <£(0)  =  1.  When  J  is  large  enough  and  the 
bandwidth  W  of  the  signal  /  is  much  smaller  than  2j7t,  i.e., 
W  <  2J7r,  then 

|2 

dt 

,  t  \—  -  /  j 

n 


Fig.  19.  Decomposition  of  the  second  test  signal  using  the  Chui-Lian  2 
wavelets  with  the  new  zeroth -order  orthogonal  prefiltering. 


//(tj-Ew'W-) 

This  is  because  d>(ui/2J)  «  1  for  large  J  for  \uj\  <W. 


2  wavelets  with  the  new  orthogonal  prefiltering  than  the  one 
with  the  Z)4  wavelet  transform  is  achieved.  This  suggests  the 
potential  applications  of  discrete  multiwavelet  transforms  in 
image  compression/denoising. 

It  is  known  that  any  nonredundant  orthogonal  transform 
keeps  the  energy.  For  example,  the  error  energy  after  the 
quantization  in  the  transform  domain  in  the  compression  is 
equal  to  the  error  energy  in  the  reconstruction  domain  in 
the  decompression.  This  no  longer  holds  for  the  redundant 
prefiltering/postfiltering  studied  in  this  paper.  In  the  case 
when  the  quantization  errors  are  random,  it  can  be  easily 
shown  that  the  error  energy  in  the  reconstruction  domain  in 
the  decompression  is  one  fourth  of  the  error  energy  in  the 
transform  domain  in  the  compression. 


Appendix 


The  error 


/(t)-£6n2-"V(2  Jt-n) 


where 

bn=  J  f{t)2J/2<f>(2Jt-n)dt 

can  be  estimated  as  follows.  In  the  Fourier  transform  domain, 
the  L 2  error  can  be  expressed  as 


/ 


f(t)-^2bn2J/2<(>(2Jt-n) 


■Ml 


dt 


-  4>(v)  ^  4>(—u>  +  2n7r) 


x  f(2J(uj  -  2n7r)) 


doj. 
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Orthonormal  Matrix  Valued  Wavelets  and 
Matrix  Karhunen-Loeve  Expansion 
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ABSTRACT.  In  this  paper,  we  study  orthonormal  matrix  valued  wavelets  for 
analyzing  matrix  (vector)  valued  signals  based  on  matrix  multiresolution  anal¬ 
ysis.  We  present  a  simple  sufficient  condition  on  the  matrix  filter  H(u/)  that 
leads  to  orthonormal  matrix  valued  wavelets.  The  sufficient  condition  is  analo¬ 
gous  to  the  one  given  by  Mallat  for  scalar  valued  wavelets.  The  components  at 
each  column  of  matrix  valued  wavelets  form  multiwavelets  for  a  scalar  valued 
signal,  where  the  orthonormality  induced  from  the  orthonormal  matrix  valued 
wavelets  is  weaker  than  the  one  in  the  current  literature  on  orthonormal  multi¬ 
wavelets.  With  the  new  orthonormality,  one  is  able  to  construct  orthonormal 
matrix  valued  wavelets  similar  to  the  conventional  multiresolution  analysis 
based  orthonormal  wavelets.  Moreover,  we  show  that  the  new  orthonormality 
provides  a  complete  Karhunen-Loeve  decomposition  for  matrix  valued  signals. 


1.  Introduction 

While  wavelets  and  multiwavelets  have  been  extensively  studied  lately  for  a 
scalar- valued  signals,  see  for  example  [1]-[17],  there  are  only  a  few  researches,  [1], 
on  matrix  (vector)  valued  wavelets  for  matrix  (vector)  valued  signals.  In  practice,  it 
is  however  often  to  encounter  matrix  (vector)  valued  signals,  such  as  video  images, 
multi-spectral  images  and  color  images.  A  significant  difference  between  matrix 
(vector)  valued  signals  and  scalar  valued  signals  is  that  there  are  correlations  for  a 
matrix  (vector)  valued  signal  not  only  in  the  time  domain  but  also  between  its  com¬ 
ponents  (or  the  spatial  domain)  at  a  fixed  time  while  there  is  correlation  for  a  scalar 
valued  signal  only  in  the  time  domain.  The  aim  of  the  construction  of  orthonormal 
matrix  valued  wavelets  is  to  decorrelate  a  matrix  (vector)  valued  signal  in  both  the 
time  and  the  spatial  domains.  As  a  side  result,  the  components  at  each  column 
of  orthonormal  matrix  valued  wavelets  also  form  multiwavelets  for  scalar  valued 
signals.  We  will  see  later  that  the  orthonormality  for  the  multiwavelets  generated 
from  orthonormal  matrix  valued  wavelets  is  weaker  than  the  orthonormality  in  cur¬ 
rent  literature  on  orthonormal  multi  wavelets,  [4]-[15].  In  [1],  orthonormal  matrix 
(vector)  multiresolution  analysis  was  introduced  for  the  purpose  of  constructing 
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orthonormal  matrix  valued  wavelets.  However,  the  theory  in  [1]  is  not  complete  in 
the  continuous  time  case  in  the  sense  that  there  is  not  a  simple  sufficient  condition 
on- the  matrix  quadrature  mirror  filter  (MQMF)  H(u;)  that  leads  to  orthonormal 
matrix  valued  wavelets. 

In  this  paper,  we  first  re-introduce  matrix  valued  signal  spaces  and  matrix 
valued  multiresolution  analysis  studied  in  [1].  We  then  present  a  simple  sufficient 
condition  on  the  MQMF  H(w)  for  constructing  orthonormal  matrix  valued  wavelets, 
which  basically  proves  the  conjecture  proposed  in  [1].  A  connection  between  or¬ 
thonormal  matrix  valued  wavelets  and  orthonormal  multiwavelets  in  the  current 
literature  is  studied.  It  can  be  seen  that  the  orthonormality  for  the  multiwavelets 
induced  from  the  orthonormality  of  orthonormal  matrix  valued  wavelets  is  weaker 
than  the  orthonormality  for  multiwavelets  in  the  recent  literature  in  the  continu¬ 
ous  time  waveform  case,  see  for  example  [4]-[15],  while  they  are  the  same  in  the 
discrete  time  filterbank  case.  The  weaker  orthonormality  in  the  continuous  time 
case  provides  a  weaker  sufficient  conditon  for  constructing  multiwavelets  with  this 
weaker  orthonormality. 

In  the  second  part  of  this  paper,  we  show  that  the  orthonormality  studied  in  this 
paper  for  matrix  valued  signals  gives  a  complete  Karhunen-Loeve  decomposition  for 
matrix  valued  signals,  i.e.,  this  orthonormality  provides  a  complete  decorrelation 
for  a  matrix  valued  signals  in  both  the  time  and  the  spatial  domains. 

2.  Matrix  Valued  Signal  Space  and  Multiresolution  Analysis 

For  convenience,  we  only  study  N  x  N  matrix  valued  signals  and  wavelets.  We 
introduce  some  notations  first. 

2.1.  Matrix  Valued  Signal  Space.  Let 
qNxN  =  .  a  is  an  N  x  N  matrix  with  entries  in  the  complex  plane  C}, 

and 

L2(a,b-,CNxN)  =  {f(t)  =  (. fk,i(t))NxN  ■■  /*,«(<)  €  L2(a,b),l  <  k,l  <  N} . 

The  signal  space  L2(a,b-,CNxN)  is  called  a  matrix  valued  signal  space.  When 
a  =  -oo  and  b  =  oo,  L2{a,b;CNxN)  is  also  denoted  by  L2(R,CNxN). 

For  any  A  €  CNxN  and  f  €  L2(a,  b ;  CNxN),  the  products 

Af,  fA  G  L2(a,b;  CNxN). 

This  implies  that  the  matrix  valued  signal  space  L2(a,b\CNxN)  is  defined  over 
C'VxV  and  not  simply  over  C. 

Let  ||  -  || Af  denote  a  matrix  norm  on  CNxN .  For  each  f  E  L2(a1b;CNxN)1  ||f|| 
denotes  the  norm  of  f  associated  with  the  matrix  norm  ||  •  \\m  as 

(2.i)  m  =  (£\m\\l,dt 

For  f  6  L2(a,b-, CNxN),  its  integration  f  f{t)dt  is  defined  by  the  integration  of  its 
components. 

For  two  matrix  valued  signals  f,  g  E  L2(a,  6;  CNxN ),  (f,  g)  denotes  the  integra¬ 
tion  of  the  matrix  product  f(^)gt(^): 
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where  *  denotes  the  conjugate  transpose.  For  convenience,  we  still  call  the  operation 
(  ,  )  in  (2.2)  inner  product  although  it  is  not  the  inner  product  in  the  common  sense. 
With  the  definition  (2.2)  it  is  clear  that  (f,g)  —  (g,  f)t. 

A  sequence  $k(t)  E  L2(a,b;CNxN),  k  E  Z,  is  called  an  orthonormal  set  in 
L2(a,6;CNxN)  if 

(2.3)  <**,*,)  =  tf(fc-l)/*,  Me  Z, 

where  6(k)  =  1  when  k  =  0  and  <5(fc)  =  0  when  A:  ^  0  and  In  is  the  N  x  N  identity 
matrix.  A  sequence  $*(£)  E  L2(a,6;  CNxN),  k  e  Z,  is  called  an  orthonormal  basis 
for  L2(a,fc;CNx^)  if  it  satisfies  (2.3),  and  moreover,  for  any  f(£)  £  L2(a,b;CNxN) 
there  exists  a  sequence  of  N  x  N  constant  matrices  F *  such  that 

(2.4)  f(t)  =  £F***(t),  forte  [a, 6], 

Jtez 

where  the  multiplication  Ffc$fc(£)  for  each  fixed  £  is  the  TV  x  N  matrix  multiplication, 
and  the  convergence  for  the  infinite  summation  is  in  the  sense  of  the  norm  ||  •  || 
defined  by  (2.1)  for  the  matrix  valued  signal  space. 

2.2.  Matrix  Valued  Multiresolution  Analysis.  We  next  define  matrix  val¬ 
ued  multiresolution  analysis,  which  is  similar  to  the  conventional  multiresolution 
analysis. 

A  matrix  valued  multiresolution  analysis  (MMRA)  of  L2(R,  CNxN)  is  a  nested 
sequence  of  closed  subspaces  Vj,  j  £  Z,  of  L2(R,  CNxN)  such  that 

(i) .  Vj  C  Vj+i,  j  £  z, 

(ii) .  is  dense  in  L2(R,  CNxN)  and  Oj€z Vj  =  {0},  where  0  is  the  all 

zero  matrix, 

(iii) .  f(£)  £  Vj  if  and  only  if  f(2£)  E  Vj+i,  j  £  Z, 

(iv) .  There  is  a  $  £  V0  such  that  its  translations  <£(£  —  k),  k  £  Z,  form  an 

orthonormal  basis  for  Vo- 

The  above  definition  for  an  MMRA  is  notationally  similar  to  the  one  for  the 
conventional  multiresolution  analysis  (MRA).  We  call  $(£)  a  matrix  valued  scaling 
function  (or  simply  scaling  function)  for  the  MMRA  {Vj}.  Since  $(t)  £  V0  C  Vi, 
there  exist  constant  N  x  N  matrices  Hk,  k  £  Z,  such  that, 

(2.5)  $(f)  =  2^Hk<t>{2t-k). 

k 

Let 

(2.6)  H(ui)  =  J2Hke~iku. 

k 

Then, 

(2.7)  =  H(|)*(|)  =  H(|)H(|)  •  •  •  4(0), 

where  it  is  assumed  that  <f>(u>)  is  continuous  at  lj  =  0.  This  assumption  is  satisfied 
when  H(u/)  has  only  finite  terms  and  H(0)  =  In-  In  this  paper,  for  convenience 
we  assume  3>(0)  =  In ,  which  makes  an  important  difference  between  matrix- valued 


162 


XIANG-GEN  XIA 


wavelets  and  multiwavelets  from  the  matrix  scaling  equation  or  refinable  equation 
point  of  view.  By  this  assumption, 

CO 

(2.8)  $(w)  =  H(|)H(^)---  = 

Z  k= 1 

The  equation  (2.7)  implies 

(2.9)  H(0 )  =  In,  or  ^THk=  In- 

k 

It  is  not  hard  to  see  that  the  orthonormality  of  $(t  -  k),  k  e  Z,  (or  the 
orthonormality  of  MMRA  {V,  })  is  equivalent  to 

(2.10)  y~^4(a;  +  27rfc)‘lt(cJ  +  2nk )  =  2ttIn >  Vu>  €  R. 

k 

In  terms  of  the  filter  HM,  the  above  orthonormality  implies 

(2.11)  H^H^u;)  +  H(a;  +  +  tt)  =  In,  Va;  E  R. 

The  orthonormality  (2.10)  is  in  the  continuous  time  domain  for  continuous-time 
waveforms  while  the  one  (2.11)  is  in  the  discrete  time  domain  for  discrete-time 
filterbanks. 

Assume  we  have  the  above  MMRA  and  H(w).  We  now  want  to  construct  its 
corresponding  matrix  valued  wavelets  that  form  an  orthonormal  basis  for  the  whole 
matrix  valued  signal  space  L2( R,  CNx/v). 

Let  G(u>)  satisfy 

(2.12)  G^H^u;)  +  G(u>  +  7r)Ht(a;  +  n)  =  0,  Vo;  E  R, 
and 

(2.13)  G(a;)Gt(u;)  +  G(u;  +  7r)Gt(a?  +  7r)  =  In ,  Vcj  E  R. 

Let 

(2.14)  *(w)  =  G(|)*(|). 

The  following  result  was  proved  in  [1]. 

Theorem  2.1.  Let  <3 >(i)  be  the  matrix  valued  function  with  its  Fourier  trans¬ 
form  defined  in  (2.14).  Then,  its  translations  V(t-k),  k  €  Z,  form  an  orthonormal 
basis  for  Wo  =  Vj  ©  V0.  Thus,  9jfk(t)  =  2^g>(2 H  -  k),  j,k  €  Z,  form  an  or¬ 
thonormal  basis  for  the  matrix  valued  signal  space  L2(R,  C  x  ). 

The  matrix  filters  H(w)  and  G(w)  in  (2.1 1)-(2.13)  are  called  matrix  quadrature 
mirror  filters  (MQMF).  Given  H(u),  G(w)  can  be  constructed  by  the  following 
method. 

Let  H(ui)  =  (HM.H^  +  tt))!  and  6(u>)  =  (G(w),G(w  +  tt))*.  Then,  the 
orthogonality  (2. 1 1  )-(2. 13)  is  equivalent  to  the  paraunitariness  of  the  2N  x  2 TV  ma¬ 
trix  (H(w),  G(u>)).  Let  Hj(w)  and  G_,(u;)  for  j  =  0, 1  be  the  polyphase  components 
of  H(u>)  and  G(w),  respectively:  F(w)  =  Fo(2u>)  +  e-,u,Fi(2u;),  where  F  is  H  or 
G.  Then,  the  above  paraunitariness  is  equivalent  to  the  paraunitariness  of  the 
matrix  (H(u>),G(w)),  where  F(o>)  =  (F0(w), Fi(a))t  for  F  =  H  or  G.  Thus,  the 
construction  of  G (J)  in  (2.11)-(2.13)  is  equivalent  to  the  completion  of  a  27V  x  27V 
paraunitary  matrix  given  its  first  TV  orthogonal  columns  H(u>).  This  completion 
can  be  obtained  by  employing  the  state-space  description,  see  for  example  [20]- [22], 
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where  only  the  orthogonal  completion  of  a  constant  orthogonal  matrix  is  needed 
for  the  corresponding  constant  realization  matrix. 

In  the  next  section,  we  want  to  construct  orthonormal  matrix  valued  scaling 
functions  $(f)  from  the  orthogonal  filter  H(w)  in  (2.11). 


3.  Construction  of  Matrix  Valued  Wavelets 

It  is  known  that  the  conventional  scaling  functions  or  MRA  can  be  constructed 
from  QMF  H(u)  and  necessary  and  sufficient  conditions  have  been  obtained,  [18]- 
[19].  For  matrix  valued  wavelets,  we  present  the  following  results.  We  first  present 
a  lemma.  In  what  follows,  we  are  only  interested  in  FIR  MQMF  H(w),  i.e.,  H(u>) 
is  a  polynomial  matrix  of  e~iu. 

Lemma  3.1.  Let  H(w)  satisfy  (2.9)  and  (2.11).  If  there  exist  a  constant  C>  0 
and  an  integer  K0  such  that  for  any  lj  <E  {-2Kn,2Kn)  and  any  K  >  K0, 

K  oo 

(3i)  nnH(|)iiM^iin<)iiM, 

j=i  z 

then,  the  solution  $(t)  in  the  matrix  dilation  equation  (2.5)  is  a  matrix  valued 
scaling  function  for  an  MMRA. 

Proof.  The  assumption  of  the  FIR  property  on  H(u)  leads  to  the  finiteness 
of  the  right  hand  side  of  (3.1).  To  prove  Lemma  3.1  we  only  need  to  prove  the 
orthonormality  of  <&(t  —  A;),  k  €  Z.  The  rest  is  similar  to  the  conventional  MRA 
theory,  see  for  example  [19]. 

For  an  integer  K  >  0,  let 

K 

Ma(w)  =  nH(o?)^|-2K*,2't!r)(W)' 

Z=I  z 

Then, 


[  u)e  xriu}duj 

Jr 

£.><!>■ ■“<£>«’<£> 

£  "  n  “(?)  +  H(^  +  *)H'<£  +  „) 


K-  1  \  t 

rr  — ,  u)  „  l 


n  H(5y)  e-^du, 


by  (2.ii) 


J^K-\(uj)fi\(_l(u)e  tnuJdu~----  J  einu,djjIN  = 


2n 


It  is  clear  that  Hk{u)  converges  to  $(u>)  pointwisely  in  (2.8)  since  H(0)  =  IN  and 
H(o>)  is  a  polynomial  matrix  of  e~iu>.  By  (3.1), 

II Ma(w)/4(u>)  -  $(w)$t(fc,)||M  <  (C+  l)||$(w)4t(w)||M,  Vw  e  R. 
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By  the  dominated  convergence  theorem,  we  have  || HkUk  -  $$*11  -»  0  as  K  —  oo. 
Therefore, 

J  -  n)dt  =  J  $(us)&{u))e~tnuJ(L> 

=  —  lim  f  iAK(u)l*j((w)e  tnuidoj  =  6(ti)In ■ 

27 T  K—*oc  JR 

This  proves  the  orthonormality  of  $(t  —  k),  k  £  Z.  * 

We  next  want  to  present  a  sufficient  condition  on  H(u>)  so  that  (3.1)  is  satisfied. 

Lemma  3.2.  Let  H(u>)  be  a  polynomial  matrix  of  e~iw  and  H(0)  =  In-  Then, 
there  exist  an  integer  Kq  and  u  constant  C  >  0  such  that 

nnH(j)iiM<ciinH(|)»*<' 

/=1  1=1 

for  uj  £  (— 7r,7r)  and  K  >  K$. 

Proof.  Since  H(w)  is  a  polynomial  matrix  of  e-’"  and  4>(0)  =  In,  we  have 

OO 

fc= 1 

and 

lim  ||^(^)  -  In\\m  —  0. 

u)—*  0 

Thus,  there  exists  an  integer  Kq  >  0  such  that,  for  k  >  K0  and  |u>|  <  7t/2, 

“  InWm  -  e’ 
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Proof.  For  u  E  (—2K7r,2K7r),  if  k  >  K,  then  u)/2k  E  (-7r/2,  7t/2).  By  the 
proof  of  Lemma  3.2,  for  uj  E  (-2 kti,  2Ki r), 


o°  \  1  1 

n  «<s>  iim  s  t 


V=/C+iCo  +  l  / 

Let  us  consider  the  case  of  l  E  {K  -f  1,  K  +  2, K  +  /fo}.  Let  6  >  0  such  that 
inf|w|<ir/2  |A(uj)|  >  <5  for  all  eigenvalue  functions  of  the  polynomial  matrix  H(u;). 
Then,  (A(u>))-1  is  an  eigenvalue  function  of  the  function  matrix  (H(u;))-1  of  variable 
e~lu;  for  \uj\  <  7r/2.  Thus,  there  exists  positive  constant  C\,  which  only  depends  on 
6 ,  such  that,  for  |cj|  <  tt/2, 

ll(H(u7))“1||Af  <  Cx. 

Therefore,  for  any  u  E  (—2K7r,  2K7t), 

K  oo  /  oo  \  -1  / K+Ko  \  ~l 

-  ii n<)  n  <}  n  <>) 


l=K+K  o  +  l 


C|inH(^)||; 


where  C  =  C^°/e.  4 

By  combining ‘the  above  three  lemmas,  we  have  proved  the  following  result. 

Theorem  3.4.  Let  H(u)  be  a  polynomial  matrix  of  e~lu  and  satisfy  (2.9)  and 
(2.11).  If 

inf  |A(a/)|  >0 

\u>\<tt/2 

for  any  eigenvalue  function  A  (a;)  of  the  polynomial  matrix  H(u>)  of  variable  e~tu;, 
then ,  the  solution  4>(£)  in  the  matrix  dilation  equation  (2.5)  is  a  matrix  valued  scal¬ 
ing  function  for  an  MMRA,  and  therefore  j,k  E  Z,  form  an  orthonormal 

basis  for  the  matrix  valued  signal  space  L2(R,  CNx N). 

Notice  that  the  above  sufficient  condition  is  analogous  of  the  one  given  by  Mallat 
[18].  With  the  above  sufficient  condition,  it  is  not  hard  to  construct  nontrivial 
families  of  orthonormal  matrix  valued  wavelets.  The  following  is  an  example. 

It  is  not  hard  to  show  that,  if  H(u;)  =  ^(Ijv+etuJE(2uj))  and  E(u;)  is  paraunitary, 
i.e.,  E(u;)E*(u;)  =  1^,  then  G(o;)  =  e“*lu;Ht(u/+ n)  and  H(u>)  form  a  pair  of  MQMF 
satisfying  (2.11)-(2.13).  Such  property  for  H(a;)  is  called  the  sampling  property  in 
[1].  Let  E(lj)  =  U(u;)diag(e^t/ciu;,  *  •  •  ,  e~tkN*)TJ*  (lj)  for  kj  =  0  or  1,  where  U(cj) 
is  an  arbitrary  paraunitary  polynomial  matrix  and  U(0)  =  1^.  Then,  it  is  not 
hard  to  see  that  the  above  H(^)  and  G(u;)  satisfy  (2. 1 1)-(2. 13)  and  the  sufficient 
condition  in  Theorem  3.4. 

4.  Connection  to  Multiwavelets 

Let  ( $(t))ik ,  and  (Vj)ik  be  the  components  at  the  Zth  column  and  kth 

row  of  $(£),  ®(Z)  and  V^,  respectively,  lyk  —  1,2, ...,  N  and  j  E  Z.  Then, 

(Vj)tk  c  (Vi+X)ifc,  and  f(t)  E  (V,),*  <=►  f(2t)  E  (Vi+1)ifc, 


njez(vj)'fc  =  {°}-  and  Uj€Z  (Vj)ifc  is  dense  in  L2(R). 
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Moreover,  for  any  fik  €  (Vo)ik,  there  exist  constants  akum,i,k  such  that 

N 

*(4.1)  flk{t)=  ^  Y,  ~  t  G  R. 

fc i € Z  m=l 

And,  for  any  /  £  L2(R),  there  exist  constants  ajtkl,i,k  such  that 

N 

(4.2)  f(t)  =  Y  ^aj,fcl,i1fc('I,jk1(t))jfc,  t  €  R, 

ez  i=i 

where  fc  is  any  integer  with  1  <  k  <  N.  This  imphes  the  following  proposition. 

Theorem  4.1.  Let  <&(f)  be  a  matrix  valued  scaling  function  of  an  MMRA  {Vj} 
and  'f'(i)  be  its  an  associated  matrix  valued  wavelet  function.  Then,  for  any  fixed 
k,  1  <  k  <  N,  the  functions  (${t))ik,  l  =  1,2,...,  AT,  form  multiscaling  functions 
and  (V(t))ik,  l  =  1,2,  ...,AT,  form  multiwavelets.  Moreover,  for  each  pair  ( l,k ),  the 
spaces  (Vj)ik,  j  €  Z,  form  a  multiresolution  analysis  of  multiplicity  rk  where  rk  is 
the  maximum  number  of  linearly  independent  functions  of  (<t>(t))ik,  l  =  1,2, ...,  N. 

For  more  about  multiresolution  analysis  of  multiplicity  r,  see  [2]-[3].  We  next 
want  to  study  the  orthonormality  of  the  column  multiscaling  functions  induced 
from  the  orthonormality  for  matrix  valued  scaling  functions,  which  is 


(4.3) 

Or, 

(4- 


N  r 

£7<*<»- 

m=l  J 


Tl))lm($*(t  ~  T2))kmdt  =  6{tX  ~  T2)6{1  ~  k). 


N  . 

4)  [($(t-Tl))lk($'(t-T2))kkdt+  (Ht-Tl))lm(**(.t-T2))kmdt 

J  m=\,m^kJ 


=  S(ri  -  t2)6{1  -  k). 

Consider  the  multiscaling  functions  from  the  kth  column  {$(t))ik{t)y  1  ^  J  — 
/V,  of  4>(£).  The  conventional  orthogonality  studied  in  the  current  literature  for 

multi  wavelets  is 

(4.5)  f($(t  -  Ti))ilfc($*(t  -  r2))hkdt  =  <5(ri  -  T2)6(h  -  l2). 

We  call  the  orthogonality  (4.5)  Orthogonality  A,  and  the  orthogonality  (4.4)  Or- 
thogonality  B ,  for  multiscaling  functions  ($(t))ik{t),  1  <1  <  N.  One  can  see  that 
the  second  term  in  the  left  hand  side  of  (4.4),  Orthogonality  B,  is  the  flexibility 
term  over  (4.5),  Orthogonality  A. 

Lemma  4.2.  The  conventional  Orthogonality  A  for  all  column  vectors  of  a  ma¬ 
trix  valued  scaling  function  implies  Orthogonality  B  induced  from  the  orthogonality 
for  matrix  valued  scaling  functions. 

Proof.  To  prove  (4.4),  we  only  need  to  prove  (4.3),  which  is 
Y  [ (^(«-n))im(«,*(f-r2))fcmd<  <4=5)  Y  6(n-T2)6(l-k)  =  N6(t\  —T2)S(l  —  k). 

m~\  m=l 


Comparing  Orthogonality  A  in  (4.5)  and  Orthogonality  B  in  (4.4)  or  (4.3),  one 
can  see  that  the  former  requires  the  orthogonality  for  each  individual  component 
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in  a  vector  while  the  later  only  needs  the  orthogonality  for  the  vector  itself.  This 
implies  that  Orthogonality  B  is  weaker  than  Orthogonality  A.  On  the  other  hand, 
these  two  orthogonalities  imply  the  same  orthogonality  (2.11)  for  the  discrete  matrix 
filterbank  H(u;). 

We  now  consider  a  subspace  of  L2(R,  CNxN): 

L2(R,CN )  =  {f  =  (fkAt))N*N  e  L2(R,CNxN)  :  /*,,(*)  =  0  for  2  <  l  <  N}, 

which  is  isomorphic  to  the  N  x  1  vector  valued  signal  space.  We  may  define  its 
corresponding  MAR,  scaling  functions,  wavelet  functions  similarly.  In  this  case, 
$(t)  =  ((${t))ki)NxN  with  ($(£))*<  =  0  for  2  <  l  <  N.  Clearly,  Orthogonality 
A  and  Orthogonality  B  are  equivalent  in  this  case.  In  other  words,  Orthogonality 
A  only  corresponds  to  Orthogonality  B  in  a  proper  subspace  of  the  matrix  valued 
signal  space. 

With  Orthogonality  A,  necessary  and  sufficient  conditions  on  H(u/)  that  leads 
to  orthogonal  multiwavelets  have  been  obtained,  see  for  example  [15].  Since  the 
stronger  Orthogonality  A  is  used,  the  necessary  and  sufficient  condition  on  H(u>)  is 
not  easy  to  check  or  use.  However,  with  the  weaker  Orthogonality  B,  the  condition 
on  H(a;)  in  Theorem  3.4  is  much  easier  to  check  so  that  one  is  able  to  use  it  to 
construct  families  of  nontrivial  orthogonal(B)  multiwavelets  as  studied  in  Section 
3.  The  basic  idea  doing  this  is  to  embed  anlVxl  vector  into  aniVxiV  matrix 
and  then  use  the  matrix  orthogonality.  Another  way  to  interpret  this  idea  is  that 
we  lift  a  one  dimensional  vector  into  a  two  dimensional  matrix  with  additional 
freedoms  to  play  with,  which  makes  the  construction  easier.  One  now  might  want 
to  ask  whether  this  new  Orthogonality  B  is  physically  meaningful.  The  answer  is 
yes  because  it  provides  a  complete  decorrelation  for  matrix  valued  signals  as  we 
shall  study  in  the  next  section. 

5.  Matrix  Karhunen  Loeve  Expansion 

In  this  section,  we  show  that  Orthogonality  B  provides  a  complete  decorrelation 
for  matrix  valued  random  processes. 

5.1.  Matrix  KL  Expansion:  Definition.  Let  X(i),  t  e  [a,  b]  with  -oo  < 
a  <  b  <  oo,  be  a  matrix  valued  random  process  with  finite  second  moments,  i.e., 

£(Xf(t)X(t))  €  CNxN, 
and  each  path  X{t)  €  L2{ R\CNxN).  Let 

(5-1)  R(M)  =  ^X^XW),  s,te[a,b). 

If  there  exist  $„(t)  €  L2{a,b;CNxN),  An  €  CNxN,  n  =  1,2,...,  such  that 

(5.2)  [  *n(s)R(s,t)ds  =  A„ *„(<),  n  =  1,2,...,  t  £  [a,  6], 

J  a 

(5-3)  ($n,$m)  =  6{m-n)IN,  m,n=  1,2,..., 

and 

OO 

X(t)  =  ^(X,$n)$„({),  t6[fl,6), 

n=l 


(5.4) 
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then,  the  expansion  of  X(t)  in  (5.4)  is  called  the  matrix  Karhunen-Loeve  expansion 
of  X(f).  If  the  matrix  Karhunen-Loeve  (MKL)  expansion  of  X(t)  exists,  then  X(t) 

is  decorrelated  into  a  matrix  valued  random  sequence  Yn  =  ($„,X)  as 

(5.5)  E(YnYl)=6{n-m)An,  m,n  =  1,2,.... 

The  random  sequence  Y„,  n  =  0,1,2,...,  is  called  the  matrix  Karhunen-Loeve 

transform  of  X(f).  , 

Notice  that  when  N  =  1,  the  above  MKL  expansions/transforms  are  reduced  to 

the  conventional  KL  expansions/transforms.  The  object  of  this  section  is  to  study 

the  MKL  expansion  of  X(f).  . 

Two  special  cases  were  studied  in  [23]-[24].  In  one,  the  constant  matrix  An  in 
(5.2)  was  replaced  by  a  scalar  value  and  in  the  other,  $n(t)  in  (5.2)  was  replaced  by 
a  scalar-valued  function.  As  mentioned  in  §3.7  in  [24],  only  a  few  cases  satisfy  these 
assumptions,  and  therefore  they  are  not  complete.  The  main  reason  for  not  using 
the  product  of  two  matrices  at  the  right  hand  side  in  (5.2)  is  due  to  the  difficulty 
of  handling  the  noncommutativity  of  matrix  products. 


5.2.  The  Generalized  Hilbert-Schmidt  and  Mercer’s  Theorems.  With¬ 
out  loss  of  generality,  in  what  follows  we  assume  a  =  0  and  6  =  T>  0.  Let^K(s,  t), 
s  t  £  fO  T]  be  a  matrix  valued  function  of  two  variables  in  L  (0,  T,  C  ).  n 
other  words,  for  each  s  €  [0,T],  K(s,-)  e  L2(0,T;  CNxN),  and  for  each  t  €  [0,T], 
K(-,t)  £  L2(0,T;CNxN),  and 


[ T  [T  \\K(s,t)W2Mdsdt  <  oo. 

Jo  Jo 


If  K(s,f)  satisfies  the  above  conditions,  then  K (s,t)  is  called  a  matrix  Fredholm 
integral  operator.  It  is  clear  that  a  matrix  Fredholm  integral  operator  K(s,  t)  maps 
L2(0,T;CWxN)  into  itself: 

(Kf)(t)  ±  [T  f(s)K{s,t)ds  6  L2{0,T;Cn*n). 

Jo 

Let  4>(f)  €  L2(0,T;CNxN)  with  <$,$)  =  In,  and  A  €  CNxN.  If  the  following 
identity  holds: 


(5.7)  /%(s)K(s,f)ds  =  A4>(t),  te[0,T], 

Jo 

then,  4>(<)  and  A  are  ceilled  eigen-matrix-functions  and  eigen-matrix-values  of  the 
operator  K(s,<)-  respectively. 

Notice  that  the  property  ($,$)  =  In  is  required  in  the  above  definitions  of 
eigen-matrix-functions  and  eigen-matrix-values,  which  is  different  from  the  scalar¬ 
valued  case.  In  the  scalar-valued  case,  if  4>(t)  is  an  eigenfunction  associated  with  an 
eigenvalue  A  for  a  scalar  Fredholm  integral  operator,  then  a<f>{t)  for  any  constant 
a  7^  0  is  also  an  eigenfunction  associated  with  A.  It  is  not  known,  however,  whether 
the  following  statement  is  true:  If  *(t)  is  an  eigen-matrix-function  associated  with 
an  eigen-matrix-value  A  for  a  matrix  Fredholm  integral  operator  K(s,  t),  then  A9(t) 
or  *(t)  A  for  an  N  x  N  matrix  A  €  CN  *  N  is  also  an  eigen-matrix-function  associated 
with  A  for  the  operator  K(s,t).  The  difficulty  is  due  to  the  noncommutativity  of 
matrix  multiplications. 
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A  matrix  Fredholm  integral  operator  K(s,£)  is  called  Hermitian  if  K (s,t)  = 
K* (t,  s)  for  s,  t  G  [0,  T\.  If  K(s,  t)  is  Hermitian  and  A  is  its  eigen-matrix- value,  then 
A  =  A*,  i.e.,  A  is  also  Hermitian.  This  is  because 

($,K^)=A  =  (($,K^))t=At. 

We  associate  each  matrix  Fredholm  integral  operator  K(s,£)  on  [0 ,T]  x  [0 ,T] 
with  the  following  scalar  Fredholm  integral  operator  K(s,t)  on  [0,  NT]  x  [0,  ATT]: 


(5.8) 


K(s,  t)  ~  Kkti(s  —  (k  -  1  )T,  t  -  (l  -  1  )T), 


if  ( $,t )  G  (( k  —  1)T,  kT]  x  ((/  —  1)T,  IT],  k,  l  =  1,2,  where  Kk,i(s,  t)  is  the 
component  function  of  K(s,£)  at  the  fcth  row  and  the  Zth  column.  The  property 
(5.6)  implies  the  following  properties  for  K(s,t): 


NT  rNT 


(5.9) 


si 


| K(s,  t)\2dtds  <  oo, 


and  if  K (s,t)  is  Hermitian  then  K(s,t)  is  also  Hermitian,  i.e.,  K(s,  t)  —  K*(t,  s), 
where  *  means  the  complex  conjugate. 

We  now  have  the  following  generalized  Hilbert-Schmidt  theorem. 

THEOREM  5.1.  Let  K (s,t),  s,t  G  [0, T],  be  a  Hermitian  matrix  Fredholm  in¬ 
tegral  operator  and  K{s\  t),  s,t  G  [0,  AT],  be  its  associated  scalar  Fredholm  inte¬ 
gral  operator .  Let  Ai,A2,...,  all  be  eigenvalues  (including  multiples)  of  K{s,t)  with 
| Ax |  >  | A2 1  >  •**.  Then,  an  N  x  N  matrix  A  is  an  eigen-matrix- value  of  the 
operator  K(s,£)  if  and  only  if 


(5.10) 


A  =  Udiag{Xni ,  •  •  •  ,\nN)U\ 


where  U  is  a  certain  N  x  TV  unitary  matrix,  and  n\, ...,  nyv  are  positive  integers  with 
Mi  <  ri2  <  •  •  *  <  nyv.  Moreover,  if  the  operator  K(s,t)  doesn’t  have  zero  eigenvalue, 
i.e.,  |An|  >  0,  n  =  1,2,...,  then,  the  eigen-matrix-functions  &n{t)  corresponding 

to  the  eigen-matrix-values  An  =  diag( \n~\)N+i,--  ,  A  nN),  n  =  1,2,...,  form  an 
orthonormal  basis  for  the  matrix  valued  signal  space  L2(0,T\CNxN). 

Proof:  From  the  definition  of  an  eigen-matrix- value  in  (5.7),  W  AU  is  an  eigen- 
matrix- value  of  K(s,Z)  if  A  is  an  eigen-matrix- value  of  K(s,t)  and  U  is  an  N  x  N 
unitary  matrix.  Thus,  to  prove  A  in  (5.10)  is  an  eigen-matrix-value  of  K(s,£), 
we  only  need  to  prove  the  diagonal  matrix  diag(AUl ,  •  •  •  ,  AnN )  is  an  eigen-matrix- 
value  of  K(s,  t).  In  fact,  without  loss  of  generality,  we  only  need  to  prove  An  is  an 
eigen- matrix- value  of  K(s,  t)  for  any  integer  n  >  1. 

Let  <t>n(t ),  t  G  [0,AT],  be  the  eigenfunctions  of  K(s,t)  corresponding  to  An, 
n  =  1,2 . i.e.,  <t>n{t ),  n  =  1,2,...,  form  an  orthonormal  set  of  L2(0,  NT;C),  and 


(5.11) 


[NT 

Jo  ‘ 


<pn(s)K(s,t)ds  =  te[0,NT}. 


Then,  equation  (5.11)  can  be  rewritten  as 
rT  N- 1 


rl 

/  V  Ms  +  kT)K(s  +  kT,  t)ds  =  A n0n(t),  t  €  [0,  ATT]. 

Jo  k=0 


(5.12) 
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Let  <j>ktn(s)  =  <f>n{s  +  kT),  a  G  [0,71,  k  =  0, 1,  ...,N  -  1.  Then, 


pT  N- 1 

/  X/  ^fc’n 

Jo  k= 0 


(5.13) 


(s)AT(s  +  fcT,  f)ds  =  A„<ft,„(f  -  IT), 


for  t  G  (IT,  (l  +  1)7],  f  =  0, 1, ...,  N  —  1.  Let 


(5.14)  *„(*)  = 


/  </»0,(n-l)N+l(s)  0O,(n-l)N+2(s) 

01,(n-l)N+l(s)  ^l,(„-l)JV+2(«) 


4>0,nN(s)  \ 

01,nAr(s) 


\  <^AT-l,(n-l)N+l(s)  <f>N-l,(n-l)N+2(s)  <£jV-l,nJv(s)  J 

By  (5.8),  (5.13)  can  be  rewritten  as 

(5.15)  [  $„(s)K(s,f)dt  =  An$„(f),  n=l,2,...,  <6  [0,T]. 

Jo 

By  the  orthonormality  of  6u(s).  t  €  [0,  NT],  it  is  not  hard  to  see  that 


(5.16)  ($m,$n>  =  6{m-n)IN,  m,n  =  1,2,.... 

Therefore,  we  have  proved  that  A„,  n  =  1,2,...,  are  eigen-matrix-values  of  the 
operator  K(s,  t ). 

Conversely,  let  A  be  an  eigen-matrix-value  of  the  operator  K(s,t).  By  the 
previous  discussion  we  know  that  A  is  Hermitian.  Thus,  there  exists  a  unitary 
matrix  U  such  that  A  =  £/diag(ai,  •  •  •  ,qcn)U*  with  |qi  !>•••>  o.vl-  By  definition 

(5.7)  of  an  eigen-matrix- value,  diag(ai,---  ,aN)  is  also  an  eigen-matrix- value  of 
K(s,  t),  i.e.,  there  is  $(f)  €  L2(0,T;  CNxN)  with  ($,  $)  =  IN  such  that 

(5.17)  f  <f>(s)K(s,t)ds  =  diag(ai,--- ,aN)$(t),  te[0,T]. 

Jo 

Assume  </>m,„(s)  is  the  mth  row  and  the  nth  column  component  function  of  $(s). 
Let  <j>n(s)  =--  $m,n(s  -  (m  -  1)T)  if  s  G  ((m  -  l)T,mT]  for  m,n  =  1,2,  ...,N.  By 

(5.8)  and  (5.17),  the  function  <pn{s )  is  an  eigenfunction  of  the  operator  K(s,  t)  with 
its  corresponding  eigenvalue  an,  n  =  1,2 Thus,  a*  =  A„fc  for  some  k  with 
Ti]  <  ri2  <  ■  ■  ■  <  tin.  This  proves  (5.10). 

When  K(s,t)  has  no  zero  eigenvalue,  by  the  scalar  Hilbert-Schmidt  Theorem 
(see  [25]),  the  eigenfunctions  <pn(t),  n  =  1,2,...,  form  an  orthonormal  basis  for 
I2(0.  NT;CNxN).  Therefore,  any  /(f)  €  L2(0,NT;C)  can  be  represented  as 

OC 

(5.18)  =  t  €  [O.ATT]. 

n  =  l 


Similarly,  (5.18)  can  be  rewritten  as 


f(t)  =  y;  f  f{s){<t>o.n(s),-  ,0,V-I.n(s))tds(</>O,n(<),- •  •  ,<t>N-l,n{t)),  tS[0,T], 

n  =  lJ° 


for  any  N  x  1  vector-valued  f  €  L2(0,  T;C'V).  By  regrouping  the  above  summation, 
we  have 


(5.19)  f(f)  =  T  [T  {(s)$l(s)*n(t)ds,  t  G  [0,71,  f€L2(0,r;CN). 

n=lJo 
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Extending  f (t)  £  L2(0,T;CN)  to  f(t)  £  L2(0,T;CNxN),  we  have 

oo 

(5.20)  f (t)  =  £(f, $„}$„(£),  t  G  [0, T],  f(t)  €  i/2(0, T;  CNxN). 

n=l 

This  proves  that  the  sequence  $„(£),  n  =  1,2,...,  forms  an  orthonormal  basis  for 

L2(0,T;CNxN).  ♦ 

From  the  above  proof,  the  eigen-matrix-function  4>n(£)  in  Theorem  5.1  associ¬ 
ated  with  the  eigen-matrix-value  An  in  Theorem  5.1  is  formulated  by  (5.14),  for 
n  =  1,2, ....  We  next  want  to  generalize  Mercer’s  Theorem.  A  matrix  Fredholm 
integral  operator  K(s,  t)  is  called  positive  if  the  N  x  N  matrix  (f,Kf)  for  any 
f(f)  G  L2(0,T;  CNxW)  is  nonnegative  definite,  i.e.,  xt(f,  Kf)x  >  0  for  any  x  G  C^. 

Lemma  5.2.  A  matrix  Fredholm  integral  operator  K(s,t)  is  positive  if  and  only 
if  its  associated  scalar  Fredholm  integral  operator  K(s,t)  is  positive. 

Proof:  Writing  (/,  Kf )  up,  similar  to  the  proof  of  Theorem  5.1,  we  have 
[NT  tNT  rT  rT 

(5.21)  /  /  f*(s)K*(s,t)f(t)dsdt=  /  /  f(f)Kt(s,t)ft(s)d£ds, 

Jo  Jo  Jo  Jo 

where  f(<)  G  L2(0,T;CN).  On  the  other  hand, 

(5.22)  xf  [ T  [T  {(t)^(s,t)fHs)dtdsx=  [ T  [T (x'f(t))K'(s,t)(x.'f(s))Utds, 

Jo  Jo  Jo  Jo 

where  x  £  CN  and  f  (t)  £  L2(0,  T;  CNx  N).  Since 

L2(0,T;Cn)  =  {f (Ox  :  x  £  CN,  f  €  L2(0,  T;  CNxN)}, 

the  values  in  (5.21)  are  nonnegative  for  all  f (t)  £  L2(0,T;  CN)  is  equivalent  to  that 
the  values  in  (5.22)  are  nonnegative  for  all  x  £  CN  and  all  f (t)  £  L2(0,  T;  CNx N). 
This  proves  Lemma  5.2.  4» 

we  have  the  following  generalized  form  of  Mercer’s  Theorem. 

Theorem  5.3.  Let  K(s>t)  be  a  Hermitian  matrix  Fredholm  integral  operator. 
IfK(s,t)  is  positive  and  its  associated  scalar  Fredholm  integral  operator  K(s,t)  is 
continuous  in  [0,  NT]  x  [0,JVT],  then 

oo 

(5-23)  K(s,t)  =  £*n(*)An*n(<),  s,£G  [0,r], 

n=l 

where.  4>r,(f)  and  An  are  the  same  as  in  Theorem  5.1  and  the  convergence  of  the 
infinite  summation  is  uniform. 

Proof:  By  Lemma  5.2,  the  operator  K(s,t)  is  also  positive.  By  Mercer’s 
theorem  for  the  operator  K(s,t)  (see  [25]), 

oc 

K(s,t)  =  '$2<pn(s)<t>n(t)\n,  s,t  €.  [0, NT], 

n=  1 

where  4>n,  An  are  eigenfunctions  and  eigenvalues  of  K(s,t )  and  the  convergence  is 
uniform.  Regrouping  the  above  summation  and  using  the  same  technique  in  the 
proof  of  Theorem  5.1,  (5.23)  can  be  proved.  A 
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5.3.  Matrix  KL  Expansions  for  Continuous-Time  Matrix  Valued  Sig¬ 
nals.  We  now  come  back  to  the  MKL  expansions  for  continuous-time  matrix  valued 

signals.  . 

Let  R (s,t)  be  the  correlation  matrix  function  defined  by  (5.1)  of  a  matrix  val¬ 
ued  random  process  X(t)  with  a  =  0  and  b  =  T.  Assume  R(s,£)  €  L  (0,T,  ^ 

Then  R(s,  t)  is  a  Hermitian  matrix  Fredholm  integral  operator  on  Zr(0,T;C  x  ); 
moreover  R (s,t)  is  positive.  Therefore,  we  can  apply  the  generalized  Hilbert- 
Schmidt  Theorem  and  the  generalized  Mercer’s  Theorem. 

Let  R(s,  t)  be  the  associated  scalar  Fredholm  integral  operator  of  the  operator 
R that  is  defined  by  (5.8).  Let  An,  n  =  1,2,...,  all  be  eigenfunctions 

and  eigenvalues  (including  multiples)  of  the  operator  R(s,t )  with 
[■NT 

(5.24)  /  <t>n{s)R{s,  t)ds  =  A n0„(t),  t  €  [0,  ATT],  n  =  1, 2, .... 

Jo 

and 

[NT 

(5.25)  /  <t>m{t)<t>*n{t)dt  =  6{m-n),  m,n  =  1,2, .... 

Jo 

where  |Ai|  >  IA2I  >  •  •  • .  Since  the  operator  R(s,  t)  is  positive,  by  Lemma  5.2,  the 
operator  R(s,  t )  is  also  positive.  Thus,  Ai  >  A2  >  •  ■  •  >  0. 

Let 

(5.26)  '  An  =  diag(A(n_i)Ar+i,--- ,A„Ar),  n  =  l,2, ..., 

and,  for  t  e  [0,T],  n  =  1,2,...,  and  $n(t)  defined  by  (5.14).  Then,  by  Theorem 
5.1,  its  proof  and  (5.25),  $„(<)  is  an  eigen-matrix-function  of  the  operator  R(s,  t) 
corresponding  to  the  eigen-matrix  value  An  in  (5.26)  for  n  =  1,2, ....  This  gives  the 
following  first  condition  on  signals  so  that  their  MKL  expansions  exist. 

Theorem  5.4.  Let  X(t),  t  €  [0,T],  be  a  random  process  with  its  correlation 
matrix  function  R (s,t)  €  L2(0,T;  CN*N).  If  K  >  0,  n  =  1,2,...,  then,  for  each 
path  ofX(t), 


(5.27) 


oc 

X(t)  =  £(X,  #„)*„(<),  t  e  [0,T], 


i.e.,  the  MKL  expansion  ofX(t )  exists  in  the  sense  (5.2)-(5-4)- 
The  second  condition  is  given  by  the  following  theorem. 

Theorem  5.5.  Let  X(t),  t  €  [0,T],  be  a  random  process  with  its  correlation 
matrix  function  R(s,  t)  £  L2(0,T;  CNxN).  If  its  associated  scalar  Fredholm  integral 
operator  R(s,  t)  is  continuous  in  [0,  NT)  x  (0,  NT],  then  the  MKL  expansion  ofX(t) 
exists: 


(5.28) 


CC 

x(t)  =  £(x,$„)$„(t),  te[0,T], 


where  the  convergence  is  in  the  mean  square  sense. 


The  proofs  of  the  above  two  theorems  are  straightforward  by  using  the  results 
in  Section  5.2. 

From  Theorems  5.4-5.5,  it  seems  that  the  MKL  expansions  of  X(t)  depend  on 
the  definition  of  the  associated  scalar  Fredholm  integral  operator  R{s,t)  of  R(s,t). 
One  might  ask,  when  the  existence  of  the  MKL  expansion  of  X(t)  in  the  sense 
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of  (5.2)- (5.4)  is  assumed,  whether  the  MKL  expansion  of  X(£)  changes  if  the  way 
to  define  R(s,t)  in  (5.8)  changes.  The  answer  is  NO.  In  other  words,  the  MKL 
expansions  (5.27)  and  (5.28)  in  Theorems  5.4-5. 5  are  necessary. 

Theorem  5.6.  Let  X(£),  t  e  [0,T],  be  a  random  process  with  its  correlation 
matrix  function  R (s,£)  €  L2(0,  T;  CNxN).  If  the  MKL  expansion  ofX(t)  exists  in 
the  sense  of  (5.2)-(5.4),  then  the  MKL  expansion  ofX(t)  can  always  be  written  as 

oo 

(5.29)  X(t)  =  £(X,*n)*n(t),  t  €  [0,71, 

*  n=l 

where  $n(£),  n  =  1,2...,  are  defined  in  (5.14). 

Proof:  By  (5.2)-(5.4),  there  exist  <(t)  €  L2(0,T;  CNxN)  and  €  CNxN , 
n  =  1,2, ...,  such  that 
T 

$'n(s)R{s,t)ds  =  An<f>'n(t),  n  =  1,2,...,  t  <E  [0, jT], 

(K^'m)  =(5(n-m)/iv,  m,n=  1,2,..., 
and 

oo 

(5.30)  X(*)  =  £(X,<><«,  *€[0,11. 

n=l 

Thus,  4>n(t)  is  an  eigen-matrix-function  of  the  operator  R(s,  t)  corresponding  to  the 
eigen-matrix- value  An  for  n  =  1, 2, ... .  By  Theorem  5.1,  there  exist  unitary  matrices 
Un  such  that  An  =  U^A^Un  for  n  =  1, 2, ...,  where  the  order  of  the  eigenvalues  An 
is  rearranged  if  necessary.  Moreover,  An  is  an  eigen-matrix-value  of  R(s,£)  with  its 
eigen-matrix-function  Un^n{t),  n  —  1,2, ...  .  Then,  similar  to  the  proof  of  Theorem 
5.1,  one  can  show  that  $n(t)  =  Un$'n(t),  n  =  1,2,...  .  By  (5.30), 

oo  oo 

X(0  =  ^(XrC/t$n)C/t$n(<)  =  53<x,*n)*n(t). 

n=l  n=l 

This  proves  (5.29).  4 

From  Thoerems  5. 1-5.6,  one  can  clearly  see  that  a  matrix  valued  random  process 
X(/)  is  completely  decorrelated  in  the  both  time  and  the  spatial  domains  using 
Orthogonality  B. 

6.  Conclusion 

In  this  paper,  we  studied  orthonormal  matrix  valued  multiresolution  analysis 
and  wavelets.  A  simple  sufficient  condition  on  the  matrix  filter  H(a>)  that  leads 
to  orthonormal  matrix  valued  wavelets  is  presented,  which  is  analogous  to  the 
one  given  by  Mallat  in  [18]  for  scalar  valued  wavelets.  This  sufficient  condition 
enables  us  to  construct  families  of  nontrivial  orthonormal  matrix  valued  wavelets. 
With  orthonormal  matrix  valued  wavelets,  one  is  able  to  construct  multiwavelets 
with  a  different  orthonormality  (called  Orthogonality  B  in  this  paper)  from  the 
one  people  currently  use  (called  Orthogonality  A  in  this  paper).  It  was  shown 
that  Orthogonality  B  is  weaker  than  Orthogonality  A.  We  believe  that  this  weaker 
orthogonality  makes  the  sufficient  condition  simple.  The  main  idea  behind  it  is  that 
one  dimensional  vectors  are  lifted  to  two  dimensional  matrices,  and  therefore  more 
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freedoms  are  available.  It  was  also  shown  that  Orthogonality  B  provides  a  complete 
Karhunen-Loeve  expansion,  i.e.,  a  complete  decorrelation,  for  matrix  valued  signals. 
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Abstract:  In  this  paper,  we  introduce  a  new  multiresolution  water¬ 
marking  method  for  digital  images.  The  method  is  based  on  the  dis¬ 
crete  wavelet  transform  (DWT).  Pseudo-random  codes  are  added  to  the 
large  coefficients  at  the  high  and  middle  frequency  bands  of  the  DWT 
of  an  image.  It  is  shown  that  this  method  is  more  robust  to  proposed 
methods  to  some  common  image  distortions,  such  as  the  wavelet  trans¬ 
form  based  image  compression,  image  rescaling/stretching  and  image 
halftoning.  Moreover,  the  method  is  hierarchical. 
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1.  Introduction 

With  the  rapid  development  of  the  current  information  technology,  electronic  publishing, 
such  as  the  distribution  of  digitized  images/videos,  is  becoming  more  and  more  popular. 
An  important  issue  for  electronic  publishing  is  copyright  protection.  Watermarking  is 
one  of  the  current  copyright  protection  methods  that  have  recently  received  considerable 
attention.  See,  for  example,  [1-8,  18],  Basically,  “invisible”  watermarking  for  digital 
images  consists  of  signing  an  image  with  a  signature  or  copyright  message  such  that 
the  message  is  secretly  embedded  in  the  image  and  there  is  negligible  visible  difference 

between  the  original  and  the  signed  images.  ... 

There  are  two  common  methods  of  watermarking:  the  frequency  domain  and 
the  spatial  domain  watermarks,  for  example  [1-8,  18].  In  this  paper,  we  focus  on  fre¬ 
quency  domain  watermarks.  Recent  frequency  domain  watermarking  methods  are  based 
on  the  discrete  cosine  transform  (DCT),  where  pseudo-random  sequences,  such  as  M- 
sequences,  are  added  to  the  DCT  coefficients  at  the  middle  frequencies  as  signatures 
[2-31.  This  approach,  of  course,  matches  the  current  image/video  compression  standards 
well,  such  as  JPEG,  MPEG1-2,  etc.  It  is  likely  that  the  wavelet  image/video  coding, 
such  as  embedded  zero-tree  wavelet  (EZW)  coding,  will  be  included  in  the  up-coming 
image/video  compression  standards,  such  as  JPEG2000  and  MPEG4.  Therefore,  it  is 
important  to  study  watermarking  methods  in  the  wavelet  transform  domain. 

In  this  paper,  we  propose  a  wavelet  transform  based  watermarking  method  by 
adding  pseudo-random  codes  to  the  large  coefficients  at  the  high  and  middle  frequency 
bands  of  the  discrete  wavelet  transform  of  an  image.  The  basic  idea  is  the  same  as  the 
spread  spectrum  watermarking  idea  proposed  by  Cox  et.  al.  in  [2].  There  are,  however, 
three  advantages  to  the  approach  in  the  wavelet  transform  domain.  The  first  advantage 
is  that  the  watermarking  method  has  multiresolution  characteristics  and  is  hierarchical. 
In  the  case  when  the  received  image  is  not  distorted  significantly,  the  cross  correlations 
with  the  whole  size  of  the  image  may  not  be  necessary,  and  therefore  much  of  t  e 
computational  load  can  be  saved.  The  second  advantage  lies  in  the  following  argument. 
It  is  usually  true  that  the  human  eyes  are  not  sensitive  to  the  small  changes  in  edges  and 
textures  of  an  image  but  are  very  sensitive  to  the  small  changes  in  the  smooth  parts  of 
an  image  With  the  DWT,  the  edges  and  textures  are  usually  well  confined  to  the  high 
frequency  subands,  such  as  HH,  LH,  HL  etc.  Large  coefficients  in  these  bands  usually 
indicate  edges  in  an  image.  Therefore,  adding  watermarks  to  these  large  coefficients 
is  difficult  for  the  human  eyes  to  perceive.  The  third  advantage  is  that  this  approach 
matches  the  emerging  image/video  compression  standards.  Our  numerical  results  show 
that  the  watermarking  method  we  propose  is  very  robust  to  wavelet  transform  based 
image  compressions,  such  as  the  embedded  zero-tree  wavelet  (EZW)  image  compression 
scheme,  and  as  well  as  to  other  common  image  distortions,  such  as  additive  noise, 
rescaling/stretching,  and  halftoning.  The  intuitive  reason  for  the  advantage  of  the  DWT 
approach  over  the  DCT  approach  in  rescaling  is  as  follows.  The  DCT  coefficients  for  the 
rescaled  image  are  shifted  in  two  directions  from  the  ones  for  the  original  image,  which 
degrades  the  correlation  detection  for  the  watermark.  Since  the  DWT  are  localized 
not  only  in  the  time  but  also  in  the  frequency  domain  [9-15],  the  degradation  for  the 
correlation  detection  in  the  DWT  domain  is  not  as  serious  as  the  one  in  the  DCT 

domain.  ,  . 

Another  difference  in  this  paper  with  the  approach  proposed  by  Cox  et.  al.  in 

[2]  is  the  watermark  detection  using  the  correlation  measure.  The  watermark  detection 
method  in  [2]  is  to  take  the  inner  product  (the  correlation  at  the  r  =  0  offset)  of  the 
watermark  and  the  difference  in  the  DCT  domain  of  the  watermarked  image  and  the 
original  image.  Even  though  both  the  difference  and  the  watermark  are  normalized,  the 
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inner  product  may  be  small  if  the  difference  significantly  differs  from  the  watermark 
although  there  may  be  a  watermark  in  the  image.  In  this  case,  it  may  fail  to  detect 
the  watermark.  In  this  paper,  we  propose  to  take  the  correlation  at  all  offsets  r  of 
the  watermark  and  the  difference  in  the  DWT  domain  the  watermarked  image  and 
the  original  image  in  different  resolutions.  The  advantage  of  this  new  approach  is  that, 
although  the  peak  correlation  value  may  not  be  large,  it  is  much  larger  than  all  other 
correlation  values  at  other  offsets  if  there  is  a  watermark  in  the  image.  This  ensures 
the  detection  of  the  watermark  even  though  there  is  a  significant  distortion  in  the 
watermarked  image.  The  correlation  detection  method  in  this  paper  is  a  relative  measure 
rather  than  an  absolute  measure  as  in  [2]. 

This  paper  is  organized  as  follows.  In  Section  2,  we  briefly  review  some  basics 
on  discrete  wavelet  transforms  (DWT).  In  Section  3,  we  propose  our  new  watermarking 
method  based  on  the  DWT.  In  Section  4,  we  implement  some  numerical  experiments  in 
terms  of  several  different  image  distortions,  such  as,  additive  noise,  rescaling/stretching, 
image  compression  with  EZW  coding  and  halftoning. 

2.  Discrete  Wavelet  Transform  (DWT):  A  Brief  Review 

The  wavelet  transform  has  been  extensively  studied  in  the  last  decade,  see  for  example  [9- 
16].  Many  applications,  such  as  compression,  detection,  and  communications,  of  wavelet 
transforms  have  been  found.  There  are  many  excellent  tutorial  books  and  papers  on 
these  topics.  Here,  we  introduce  the  necessary  concepts  of  the  DWT  for  the  purposes  of 
this  paper.  For  more  details,  see  [9-15]. 

The  basic  idea  in  the  DWT  for  a  one  dimensional  signal  is  the  following.  A 
signal  is  split  into  two  parts,  usually  high  frequencies  and  low  frequencies.  The  edge 
components  of  the  signal  are  largely  confined  to  the  high  frequency  part.  The  low  fre¬ 
quency  part  is  split  again  into  two  parts  of  high  and  low  frequencies.  This  process  is 
continued  an  arbitrary  number  of  times,  which  is  usually  determined  by  the  application 
at  hand.  Furthermore,  from  these  DWT  coefficients,  the  original  signal  can  be  recon¬ 
structed.  This  reconstruction  process  is  called  the  inverse  DWT  (IDWT).  The  DWT 
and  IDWT  can  be  mathematically  stated  as  follows. 

Let 

H(w)  =  Y,  hke-jku/,  and  G(u>)  =  Y^e^. 
k  k 

bo  a  lowpass  and  a  highpass  filter,  respectively,  which  satisfy  a  certain  condition  for 
reconstruction  to  be  stated  later.  A  signal,  x[n]  can  be  decomposed  recursively  as 


— lffc  —  ^  ^  hn-2kCj,n 

n 

(i) 

dj—lfk  =  ^  ]  9n-2kCj,n 

(2) 

n 


for  j  =  J  + 1,  J, ...,  Jo  where  cj+i,*  =  x\k),  k  G  Z,  J  + 1  is  the  high  resolution  level  index, 
and  Jo  is  the  low  resolution  level  index.  The  coefficients  c/0tfc,  dj0,fc,  djr0+i,*:, ...,  dj^ are 
called  the  DWT  of  signal  x[n],  where  is  the  lowest  resolution  part  of  x[n ]  and  dj^ 
are  the  details  of  x[n]  at  various  bands  of  frequencies.  Furthermore,  the  signal  x[n]  can 
be  reconstructed  from  its  DWT  coefficients  recursively 

Cj,n  —  h,n  —  2kCj-l,k  H"  Qn- 2kdj  —  l,fc*  (3) 

k  k 
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reconstruction 


Figure  1.  DWT  for  one  dimensional  signals. 


n  n 


Figure  2.  DWT  for  two  dimensional  images. 

The  above  reconstruction  is  called  the  IDWT  of  x[n).  To  ensure  the  above 
IDWT  and  DWT  relationship,  the  following  orthogonality  condition  on  the  filters  H{u)) 
and  G(uj)  is  needed: 

Ml2  +  |GM|2  =  i. 

An  example  of  such  H(u)  and  G(u>)  is  given  by 

%)  =  \  +  and  G(v)  =  \- 
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which  are  known  as  the  Haar  wavelet  filters. 

The  above  DWT  and  IDWT  for  a  one  dimensional  signal  x[n]  can  be  also  de¬ 
scribed  in  the  form  of  two  channel  tree-structured  filterbanks  as  shown  in  Fig.  1.  The 
DWT  and  IDWT  for  two  dimensional  images  x[m,n]  can  be  similarly  defined  by  imple¬ 
menting  the  one  dimensional  DWT  and  IDWT  for  each  dimension  m  and  n  separately: 
DWTn[DWTm[x[m, n]]\,  which  is  shown  in  Fig.  2.  An  image  can  be  decomposed  into 
a  pyramid  structure,  shown  in  Fig.  3,  with  various  band  information:  such  as  low-low 
frequency  band,  low-high  frequency  band,  high-high  frequency  band  etc.  An  example 
of  such  decomposition  with  two  levels  is  shown  in  Fig.  4,  where  the  edges  appear  in  all 
bands  except  in  the  lowest  frequency  band,  i.e.,  the  corner  part  at  the  left  and  top. 


ll3 

HL  3 

HL, 

lh3 

HH  3 

HL^ 

lh2 

HH  2 

LHj 

HHj 

Figure  3.  DWT  pyramid  decomposition  of  an  image. 


Figure  4.  Example  of  a  DWT  pyramid  decomposition. 

3.  Watermarking  in  the  DWT  Domain 

Watermarking  in  the  DWT  domain  is  composed  of  two  parts:  encoding  and  decoding. 
In  the  encoding  part,  we  first  decompose  an  image  into  several  bands  with  a  pyramid 
structure  as  shown  in  Figs.  3-4  and  then  add  a  pseudo-random  sequence  (Gaussian 
noise)  to  the  largest  coefficients  which  are  not  located  in  the  lowest  resolution,  i.e.,  the 
corner  at  the  left  and  top,  as  follows.  Let  y[m,n]  denote  the  DWT  coefficients,  which 
are  not  located  at  the  lowest  frequency  band,  of  an  image  x[n,m],  We  add  a  Gaussian 
noise  sequence  iV[m,n]  with  mean  0  and  variance  1  to  y\m,  nj: 

j/[m,n]  =  y[m,  n]  +  ay2[m,n]N[m,n],  (4) 

where  a  is  a  parameter  to  control  the  level  of  the  watermark,  the  square  indicates  the 
amplification  of  the  large  DWT  coeffcients.  We  do  not  change  the  DWT  coefficients  at 
the  lowest  resolution.  Then,  we  take  the  two  dimensional  IDWT  of  the  modified  DWT 
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coefficients  y  and  the  unchanged  DWT  coefficients  at  the  lowest  resolution.  Let  x\m,  n] 
denote  the  IDWT  coefficients.  For  the  resultant  image  to  have  the  same  dynamic  range 
as  the  original  image,  it  is  modified  as 

x[m,n]  =  min(max(x[m,n]),max{x[m,n],min(x[m,  n])}).  (5) 

The  operation  in  (5)  is  to  make  the  two  dimensional  data  x[m,n]  be  the  same  dynamic 
range  as  the  original  image  x[m,  n].  The  resultant  image  x[m,n]  is  the  watermarked 
image  of  x[m,n].  The  encoding  part  is  illustrated  in  Fig.  5(a). 


Original 

image 


Insert 

watermarks 


Gaussian 


noise 


Inverse 

DWT 


Watermarked 

image 


(a) :  Encoding 


Watermarked 


(b) :  Decoding 


Figure  5.  Watermarking  in  the  DWT  domain. 


The  decoding  method  we  propose  is  hierarchical  and  described  as  follows.  We 
first  decompose  a  received  image  and  the  original  image  (it  is  assumed  that  the  original 
image  is  known)  with  the  DWT  into  four  bands,  i.e.,  low-low  (LLi)  band,  low-high 
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(LH i)  band,  high-low  {HL\)  band,  and  high-high  ( HH\ )  band,  respectively.  We  then 
compare  the  signature  added  in  the  HH\  band  and  the  difference  of  the  DWT  coeffi¬ 
cients  in  HH\  bands  of  the  received  and  the  original  images  by  calculating  their  cross 
correlations.  If  there  is  a  peak  in  the  cross  correlations,  the  signature  is  called  detected. 
Otherwise,  compare  the  signature  added  in  the  HH\  and  LH\  bands  with  the  difference 
of  the  DWT  coefficients  in  the  HH\  and  LH\  bands,  respectively.  If  there  is  a  peak,  the 
signature  is  detected.  Otherwise,  we  consider  the  signature  added  in  the  HL\ ,  LHu  and 
HH\  bands.  If  there  is  still  no  peak  in  the  cross  correlations,  we  continue  to  decompose 
the  original  and  the  received  signals  in  the  LL\  band  into  four  additional  subbands  LL2, 
LH2 ,  HL2  and  HH2  and  so  on  until  a  peak  appears  in  the  cross  correlations.  Otherwise, 
the  signature  can  not  be  detected.  The  decoding  method  is  illustrated  in  Fig.  5(b). 

4.  Numerical  Examples 

We  implement  two  watermarking  methods:  one  is  using  the  DOT  approach  proposed 
by  Cox  el.  al.  in  [2]  and  the  other  is  using  the  DWT  approach  proposed  in  this  paper. 
In  the  DWT  approach,  the  Haar  DWT  is  used.  Two  step  DWT  is  implemented  and 
images  are  decomposed  into  7  subbands.  Watermarks,  Gaussian  noise,  are  added  into 
all  6  subbands  but  not  in  the  lowest  subband  (the  lowest  frequency  components).  In 
the  DCT  approach,  watermarks  (Gaussian  noise)  are  added  to  all  the  DCT  coefficients. 
The  levels  of  watermarks  in  the  DWT  and  DCT  approaches  are  the  same,  i.e.,  the  total 
energies  of  the  watermark  values  in  these  two  approaches  are  the  same.  It  should  be 
'  noted  that  we  have  also  implemented  the  DCT  watermarking  method  when  the  pseudo¬ 
random  sequence  is  added  to  the  DCT  values  at  the  same  positions  as  the  ones  in  the 
above  DWT  approach,  i.e.,  the  middle  frequencies.  We  found  that  the  performance  is 
not  as  good  as  the  one  by  adding  watermarks  in  all  the  frequencies  in  the  DCT  domain. 

Two  images  with  size  512  x  512,  “peppers”  and  “car,”  are  tested.  Fig.  6(a) 
shows  the  original  “peppers”  image.  Fig.  6(b)  shows  the  watermarked  image  with  the 
DWT  approach  and  Fig.  7(a)  shows  the  watermarked  image  with  the  DCT  approach. 
Both  watermarked  images  are  indistinguishable  from  the  original.  A  similar  property 
holds  for  the  second  test  image  “car,”  whose  original  image  is  shown  in  Fig.  8(b). 

The  first  distortion  against  which  we  test  our  algorithm  with  is  additive  noise. 
Two  noisy  images  are  shown  in  Fig.  7(b)  and  Fig.  8(a),  respectively.  When  the  variance 
of  the  additive  noise  is  not  too  large,  such  as  the  one  shown  in  Fig.  7(b),  the  signature 
can  be  detected  only  using  the  information  in  the  HH\  band  with  the  DWT  approach, 
where  the  cross  correlations  are  shown  in  Fig.  9(a)  and  a  peak  can  be  clearly  seen. 
When  the  variance  of  the  additive  noise  is  large,  such  as  the  one  shown  in  Fig.  8(a), 
the  HH\  band  information  is  not  good  enough  with  the  DWT  approach,  where  the 
cross  correlations  are  shown  in  Fig.  9(b)  and  no  clear  peak  can  be  seen.  However,  the 
signature  can  be  detected  by  using  the  information  in  the  HH\  and  LH\  bands  with 
the  DWT  approach,  where  the  cross  correlations  are  shown  in  Fig.  9(d)  and  a  peak 
can  be  clearly  seen.  For  the  second  noisy  image,  we  have  also  implemented  the  DCT 
approach.  In  this  case,  the  signature  with  the  DCT  approach  can  not  be  detected,  where 
the  correlations  are  shown  in  Fig.  9(c)  and  no  clear  peak  can  be  seen.  Similar  results 
hold  for  the  “car”  image  and  the  correlations  are  shown  in  Fig.  10. 

The  second  “test”  distortion  is  rescaling/stretching  for  “peppers”  and  “car” 
images,  three  types  of  rescaling/stretchings  are  implemented.  In  the  first  two  imple¬ 
mentations,  the  rescaled/stretched  images  are  rescaled  back  to  the  same  size  of  the 
original  image  using  interpolations,  where  25%  reduction/enlargement  is  used.  In  the 
third  implementation,  the  stretched  images  are  simply  cut  back  to  the  original  size, 
where  1%  and  2%  stretching  is  used. 

In  the  rescaling ,  an  image,  x,  is  reduced  to  3/4  of  the  original  size.  The  method  of 
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the  rescaling  is  from  the  MATLAB  function  called  “imresize.”  as  imresize  (x ,  1-1/4 , 
Method’)  where  ’method’  indicates  one  of  the  methods  in  the  interpolations  between 
pixels:  piecewise  constant,  bilinear  spline,  and  cubic  spline.  With  the  received  smaller 
size  image,  for  the  watermark  detection  we  extend  it  to  the  normal  size,  i.e.,  512  x  512, 
by  using  the  same  Matlab  function  “imresize”  as  imresize (y,  1+1/3,  Method*), 
where  ’method’  is  also  one  of  the  above  interpolation  methods.  In  this  experiment,  we 
implemented  two  different  interpolation  methods  in  imresize  in  the  rescaling  distor¬ 
tion:  the  piecewise  constant  method  and  the  cubic  spline  method.  In  the  detection,  we 
alway  use  the  cubic  spline  as  imresize(y,  1+1/3,  ’bicubic’).  Similar  results  also 
hold  for  other  combinations  of  these  interpolation  methods.  Fig.  11  illustrate  the  de¬ 
tection  results  for  the  “peppers”  image:  Fig.  11(a), (c)  show  the  cross  correlations  with 
the  DWT  approach  while  Fig.  ll(b),(d)  show  the  cross  correlations  with  the  DCT  ap¬ 
proach.  In  Fig.  11(a),  (b),  the  rescaling  method  is  imresize  (x,  1-1/4,  ’nearest ’),  i.e., 
the  piecewise  constant  interpolation  is  used.  In  Fig.  11(c), (d),  the  rescaling  method  is 
imresize (x,  1-1/4 ,  ’bicubic ’),  i.e.,  the  cubic  spline  interpolation  is  used.  One  can  see 
the  better  performance  of  the  DWT  approach  over  the  DCT  approach.  Similar  results 
hold  for  the  “car”  image  and  are  shown  in  Fig.  12. 

When,  in  the  above  rescaling  experiment,  the  size  of  an  image  is  first  reduced 
and  then  extended  in  the  detection,  in  the  stretching,  an  image  is  first  extended  and 
then  reduced  in  the  detection.  The  same  Matlab  function  imresize  as  in  the  rescaling 
is  used.  In  the  stretching  experiment,  an  image  is  extended  by  1/4  of  the  original  size, 
i.e.,  the  MATLAB  function  imresize(x,  1+1/4,  ’method’),  is  used,  where  ’method’ 
is  the  same  as  in  the  rescaling.  In  the  detection,  the  received  image  is  reduced  by  1/5 
to  the  original  size,  i.e.,  the  Matlab  function  imresize (y,  1-1/5,  ’method’)  is  used. 
The  rest  is  similar  to  the  one  in  the  rescaling.  Figs.  13  and  14  show  the  correlation 
properties  for  the  “peppers”  and  the  “car”  images,  respectively. 

In  the  third  implementation  of  rescaling/stretching,  an  image  is  first  stretched 
by  1%  and  2%  using  the  MATLAB  function  imresize (y,  1+1/100,  ’method’)  and 
imresize (y,  1+2/100,  ’method’),  respectively.  The  stretched  image  is  then  cut  back 
to  the  original  size.  Two  images  “peppers”  and  “car”  are  tested.  Figs.  15-16  shows  the 
correlation  properties  for  the  “peppers”  and  the  “car”  images,  respectively,  where  (a) 
and  (b)  are  for  the  1%  stretching,  and  (c)  and  (d)  are  for  the  2%  stretching. 

The  third  “test”  distortion  is  image  compression.  Two  watermarked  images  with 
the  DWT  and  DCT  approaches  shown  in  Fig.  6(b)  and  Fig.  7(a)  are  compressed  by  us¬ 
ing  the  EZW  coding  algorithm.  The  compression  ratio  is  chosen  as  64,  i.e.,  0.1256pp. 
With  these  two  compressed  images,  the  correlations  are  shown  in  Fig.  17  (a)  and  (b), 
where  a  peak  in  the  middle  can  be  clearly  seen  in  Fig.  17(a)  with  the  DWT  approach, 
but  no  clear  peaks  can  be  seen  in  Fig.  17(b)  with  the  DCT  approach.  This  is  not  very 
surprising  because  the  compression  scheme  is  not  suitable  for  the  DCT  approach.  It 
should  be  noticed  that  the  wavelet  filters  in  the  EZW  compression  are  the  commonly 
used  Daubechies  “9/7”  biorthogonal  wavelet  filters  while  the  wavelet  filters  in  the  wa¬ 
termarking  are  the  simpliest  Haar  wavelet  filters  mentioned  in  Section  2. 

The  last  “test”  distortion  is  halftoning.  The  two  watermarked  images  in  Fig. 
6(b)  and  Fig.  7(a)  are  both  halftoned  by  using  the  following  standard  method.  Let 
x[m,n]  be  an  image  with  8  bit  levels.  To  halftone  it,  we  do  the  nonuniform  thresholding 
through  the  Bayer’s  dither  matrix  T  [17]: 

/ 

T  =  (Titfc)4x4  -  16 
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in  the  following  way*  Compare  each  disjoint  4x4  blocks  in  the  image  x[m,  n].  If  x[m  * 
4  +  j,n  *  4  +  k]  >  Tjy ki  then  it  is  quantized  to  1,  and  otherwise  it  is  quantized  to  0. 
Both  DWT  and  DCT  watermarking  methods  are  tested.  Surprisingly,  we  found  that  the 
watermarking  method  based  on  DWT  we  proposed  in  this  paper  is  more  robust  than 
the  method  based  on  the  DCT  in  [2-3].  The  correlations  are  shown  in  Fig.  18(a)  and 
(b),  where  (a)  corresponds  to  the  DWT  approach  while  (b)  corresponds  to  the  DCT 
approach.  One  can  clearly  see  a  peak  in  the  middle  in  Fig.  18(a)  while  no  any  clear  peak 
in  the  middle  can  be  seen  in  Fig.  18(b).  In  this  experiment,  the  watermark  was  added 
to  the  middle  frequencies  in  the  DCT  approach  and  no  inverse  halftoning  was  used. 

5.  Conclusion 

In  this  paper,  we  have  introduced  a  new  multiresolution  watermarking  method  using  the 
discrete  wavelet  transform  (DWT).  In  this  method,  Gaussian  random  noise  is  added  to 
the  large  coefficients  but  not  in  the  lowest  subband  in  the  DWT  domain.  The  decoding  is 
hierarchical.  If  distortion  of  a  watermarked  image  is  not  serious,  only  a  few  bands  worth 
of  information  are  needed  to  detect  the  signature  and  therefore  computational  load  can 
be  saved.  We  have  also  implemented  numerical  examples  for  several  kinds  of  distortions, 
such  as  additive  noise,  rescaling/stretching,  compressed  image  with  the  wavelet  approach 
such  as  the  EZW,  and  halftoning.  It  is  found  that  the  DWT  based  watermark  approach 
we  proposed  in  this  paper  is  robust  to  all  the  above  distortions  while  the  DCT  approach 
is  not,  in  particular,  to  distortions,  such  as  compression,  rescaling/stretching  (1%,  2%, 
and  25%  were  tested),  and  additive  noise  with  large  noise  variance. 

6.  Acknowledgements 

Xia  was  supported  in  part  by  the  Air  Force  Office  of  Scientific  Research  (AFOSR)  under 
Grant  No.  F49620-97-1-0253  and  the  National  Science  Foundation  CAREER  Program 
under  Grant  MIP-9703377.  Boncelet  and  Arce  were  supported  in  part  through  collabo¬ 
rative  participation  in  the  Advanced  Telecommunications/Information  Distribution  Re¬ 
search  Program  (ATIRP)  Consortium  sponsored  by  the  U.S.  Army  Research  Laboratory 
under  the  Federated  Laboratory  Program,  Cooperative  Agreement  DAAL01-96-0002. 
Arce  was  also  supported  in  part  by  the  National  Science  Foundation  under  the  Grant 
MIP-9530923.  They  wish  to  thank  the  anonymous  referees  and  the  guest  editor,  Dr.  In- 
gomar  Cox,  for  their  many  helpful  comments  and  suggestions  that  improved  the  clarity 
of  this  manuscript.  They  would  also  like  to  thank  Mr.  Jose  Paredes  for  implementing 
numerous  image  compressions  using  the  EZW  method. 


#7038  -  $15.00  US 
(C)OSA  1998 


Received  October  14,  1998;  Revised  November  25,  1998 
7  December  1998  /  Vol.  3,  No.  12  /  OPTICS  EXPRESS  505 


Figure  6.  (a)  Original  “pepper”  image;  (b)  Watermarked  image  using  DWT, 
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HH  band,  low  noise  image  HH  band,  high  noise  image 


Figure  9.  Correlations  for  watermark  detection  for  the  “peppers”  image:  (a)  DWT 
with  HH\  band  for  low  additive  noise;  (b)  DWT  with  HH\  band  for  high  additive 
noise;  (d)  DWT  with  HH\  and  LH\  bands  for  high  additive  noise;  (c)  DCT  for 
high  additive  noise. 


Figure  10.  Correlations  for  watermark  detection  for  the  “car”  image:  (a)  DWT 
with  HH\  band  for  low  additive  noise;  (b)  DWT  with  HH\  band  for  high  additive 
noise;  (d)  DWT  with  HH\  and  LH\  bands  for  high  additive  noise;  (c)  DCT  for 
high  additive  noise. 
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Figure  11.  Correlations  for  watermark  detection  for  the  rescaled  “peppers”  image: 
(a)  and  (b)  piecewise  constant  interpolation  in  the  rescaling  and  (a)  DWT  (b)  DCT; 
(c)  and  (d)  cubic  spline  interpolation  in  the  rescaling  and  (c)  DWT  (d)  DCT. 
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Figure  12.  Correlations  for  watermark  detection  for  the  rescaled  “car”  image:  (a) 
and  (b)  piecewise  constant  interpolation  in  the  rescaling  and  (a)  DWT  (b)  DCT; 
(c)  and  (d)  cubic  spline  interpolation  in  the  rescaling  and  (c)  DWT  (d)  DCT. 
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Figure  13.  Correlations  for  watermark  detection  for  the  stretched  “peppers”  im¬ 
age:  (a)  and  (b)  piecewise  constant  interpolation  in  the  rescaling  and  (a)  DWT  (b) 
DCT;  (c)  and  (d)  cubic  spline  interpolation  in  the  rescaling  and  (c)  DWT  (d)  DCT. 
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Figure  14.  Correlations  for  watermark  detection  for  the  stretched  “car”  image: 
(a)  and  (b)  piecewise  constant  interpolation  in  the  rescaling  and  (a)  DWT  (b)  DCT; 
(c)  and  (d)  cubic  spline  interpolation  in  the  rescaling  and  (c)  DWT  (d)  DCT. 


#7038  -  515.00  US 
(C)  OS  A  1998 


Received  October  14,  1998;  Revised  November  25,  1998 

7  December  1998  /  Vol.  3,  No.  12  /  OPTICS  EXPRESS  509 


1 


DWT  correlation 


DCT  correlation 


Figure  15.  Correlations  for  watermark  detection  for  the  stretched  “peppers”  im¬ 
age:  (a)  and  (b)  1%  stretching  and  (a)  DWT  (b)  DCT;  (c)  and  (d)  2%  stretching 
and  (c)  DWT '(d)  DCT. 
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Figure  16.  Correlations  for  watermark  detection  for  the  stretched  “car”  image: 
(a)  and  (b)  1%  stretching  and  (a)  DWT  (b)  DCT;  (c)  and  (d)  2%  stretching  and 
(c)  DWT  (d)  DCT. 
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Figure  17.  Correlations  for  watermark  detection  for  compressed  images:  (a) 
DWT;  (b)  DQT. 
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Figure  18.  Correlations  for  watermark  detection  for  halftoned  images:  (a)  DWT; 
(b)  DCT. 
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PROOF  The  authors  [1]  assumed  that  both  the  target 
relative  position  and  relative  speed  are  uniformly 
distributed  between  0  to  60  km  and  - 40  to  40  m/s, 
respectively.  Let  us  derive  the  variance  of  uniform 
distributed  variable.  The  uniform  probability  density 
function  (pdf)  is  defined  by  [2] 

/(0=r-L  °<t<b 

b  —  a 

-  0  otherwise 


for  real  constants  -oc  <  a  <  oc  and  b  >  a.  Fig.  1 
illustrates  the  behavior  of  the  above  function. 
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In  case  of  relative  range,  x  component  b  -  60 sin  30, 
a  -  0  and  y  component  b  =  60cos/30,  a  =  0.  Hence 
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Doppler  Ambiguity  Resolution  Using  Optimal 
Multiple  Pulse  Repetition  Frequencies 


Ferrari,  Berenguer,  and  Alengrin  recently  proposed  an 
algorithm  for  velocity  ambiguity  resolution  in  coherent  pulsed 
Doppler  radar  using  multiple  pulse  repetition  frequencies  (PRFs). 
In  this  algorithm,  two  step  estimations  (folded  frequency  and 
ambiguity  order)  for  the  Doppler  frequency  is  used  by  choosing 
particular  PRF  values.  The  folded  frequency  is  the  fractional  part 
of  the  Doppler  frequency  and  is  estimated  by  averaging  the  folded 
frequency  estimates  for  each  PRF.  The  ambiguity  order  is  the 
integer  part  of  the  Doppler  frequency  and  is  estimated  by  using 
the  quasi -maximum-likelihood  criterion.  The  PRF  are  grouped 
into  pairs  and  each  pair  PRF  values  are  symmetric  about  1.  The 
folded  frequency  estimate  for  each  pair  is  the  circular  mean  of  the 
two  folded  frequency  estimates  of  the  pair  due  to  the  symmetry 
property. 

We  propose  a  new  algorithm  based  on  the  optimal  choice 
of  the  PRF  values,  where  the  PRF  values  are  also  grouped  into 
pairs.  In  each  pair  PRF  values,  one  is  given  and  the  other  is  ' 
optimally  chosen.  The  optimality  is  built  upon  the  minimal 
sidelobes  of  the  maximum  likelihood  criterion.  Numerical 
simulations  are  presented  to  illustrate  the  improved  performance. 


I.  INTRODUCTION 

Multiple  pulse  repetition  frequency  (PRF)  is 
commonly  used  in  modern-day  radars  for  the  velocity 
ambiguity  resolution  in  coherent  pulsed  Doppler 
radars,  see  for  example  [1-4].  In  this  approach,  the 
conventional  method  for  achieving  the  ambiguity 
resolution  is  to  search  for  the  coincidence  between 
unfolded  Doppler  frequency  estimates  for  each 
PRF,  see  for  example  [2-4].  Since  the  Doppler 
frequency  may  take  all  possible  real  values  in  a 
range  and  infinite  many  trials  are  needed  for  all 
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Fig  1.  Multiple  PRF  waveform. 


the  possibilities  of  the  Doppler  frequency,  it  maybe 
impossible  to  have  an  exact  match.  Thus,  estimation 
errors  usually  occur.  Based  on  this  observation,  a 
two  step  estimation  algorithm  has  been  proposed  in 
[1]  by  Ferrari,  Berenguer,  and  Alengrin.  The  basic 
idea  for  the  two  step  estimation  is  the  following. 

The  Doppler  frequency  is  decomposed  into  two 
pans:  the  folded  part,  i.e.,  the  fractional  part  modulo 
1,  and  the  ambiguity  order  part,  i.e.,  the  integer 
pan.  By  grouping  the  PRFs  into  pairs  where  each 
pair  is  symmetric  about  1,  the  folded  part  is  the 
“circular  mean”  [5]  of  the  folded  estimates  of  the  pair 
PRFs.  This  circular  averaging  is  the  First  step  of  the 
algorithm  in  [1].  After  the  folded  part  is  estimated, 
the  second  step  is  to  find  the  match  of  the  ambiguity 
order.  By  noticing  that  the  ambiguity  order  takes 
integer  values,  there  are  only  finite  many  possible 
trials  needed  ranging  from  the  minimal  and  the 
maximal  possible  ambiguity  orders.  Therefore,  the 
exact  estimation  of  the  Doppler  frequency  becomes 
possible  with  the  two  step  estimation.  Note  that  the 
key  of  this  method  is  to  convert  the  infinite  many  trials 
to  the  finite  many  trials,  by  converting  a  general  real 
number  matching  to  an  integer  matching. 

The  motivation  for  this  paper  is  as  follows.  Since 
the  specific  PRP  pairs,  which  are  symmetric  about 
1 ,  are  needed  in  the  Ferrari-Berenguer-Alengrin 
approach,  it  may  reduce  the  detectability  of  using  the 
maximum  likelihood  criterion  to  detect  the  peak  or 
the  match.  It  is  because  the  sidelobes  of  the  maximum 
likelihood  function  with  the  specific  PRFs  may  not 
be  as  low  as  the  one  with  other  PRFs.  The  motivation 
of  this  work  is  to  relax  the  above  PRF  condition  in 
the  following  way:  one  of  each  pair  PRFs  is  fixed  and 
the  other  of  the  pair  is  optimally  determined  based 
on  the  lowest  sidelobes  of  the  maximum  likelihood 
function.  With  this  relaxization,  the  “circular  mean” 
estimation  of  the  folded  frequency  may  not  be  as  good 
as  the  one  in  [1].  We  propose  an  alternative  approach 
to  achieve  the  folded  frequency  estimation  as  follows. 
We  first  take  the  conventional  mean  of  the  folded 


frequency  estimates  in  each  pair.  The  true  folded 
frequency  falls  in  a  finite  number  of  possibilities  from 
the  conventional  mean.  These  finite  possibilities  of 
the  folded  frequency  can  be  obtained  when  the  PRF 
pairs  are  known.  Since  the  ambiguity  order  has  also 
finite  possibilities,  the  overall  folded  frequency  and 
the  ambiguity  order  have  finite  possibilities.  This 
suggests  us  to  estimate  both  the  folded  frequency 
and  the  ambiguity  order  simultaneously  based  on 
the  maximum  likelihood  criterion.  What  is  gained 
here  is  the  detectability  improvement  of  the  Doppler 
frequency  while  the  penalty  is  the  increase  of  the 
computational  complexity  with  a  multiple  of  the 
one  in  f  1]  due  to  more  possibilities  to  search  for.  the 
folded  frequency.  . 

This  paper  is  organized  as  follows.  In  Section  II, 
we  briefly  review  the  Ferrari-Berenguer-Alengrin 
approach  proposed  in  [1].  In  Section  III,  we 
study  the  optimal  PRF  method.  In  Section  IV,  we 
present  numerical  examples  which  outperform  the 
Ferrari-Berenguer-Alengrin  method. 

II.  THE  FERRARI-BERENGUER-ALENGRIN  TWO 

STEP  ESTIMATION  METHOD 

First  of  all,  we  briefly  describe  the  problem.  Let 
radar  transmit  2 Nb  bursts  of  ns  pulses,  where  the  PRF 
of  the  kih  burst  is  assumed  Fr(k),  1  <  k  <  2 Nb.  The 
time  difference  between  two  pulses  in  the  kth  burst  is 
Tr(k)  -  1  /Fr(k).  It  is  assumed  that  the  elapsed  time 
between  the  last  pulse  of  the  Jtth  burst  and  the  first 
one  of  the  (it  +  l)th  burst  is  7>(it).  The  time  delays 
Tr(k)  are  assumed  as 

Tm‘('*m)Tr  (l) 

where  N(l),N(2),...,N(2Nb)  are  integers  and  Tr  is 
usually  assumed  as  l  for  simplicity.  The  multiple  PRF 
waveform  is  shown  in  Fig.  1. 

After  coherent  demodulation,  the  received  data 
at  the  nth  sample,  0  <  n  <  ns  -  1,  in  the  kth  burst. 
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Let 


1  <  k  <  2Nb<  becomes 

yk(n)  =  xk(n)  +  bk(n )  =  c ik(.fD)cxp(j2-fDnTr(k ))  +  bkUD 

(2) 

where  fD  is  the  unknown  Doppler  frequency,  bk ( n ) 
is  white  noise- from  the  contribution  of  both  thermal 
noise  and  clutter  whitened  residue,  and  ak(fD)  contains 
the  initial  phase  of  the  target  signal  on  the  kth  burst.  If 
«,( fD)  =  A,  then  we  have 


f  =  fr  +  1  <  k  <  2Nb.  (8) 

If  fk  could  be  obtained  from  vA.(rt),  0  <  n  <  n^-  1% 
in  (7),  by  using  N(2p  +  2)  =  -N(2p  +  1)  in  (5),  the 
reduced  frequency  fr  would  be 


__  flp*  I  f2p+2 


o  <  P  <  Nb  -  I . 


ak(fD)  =  ,4 exp  j2r:nsfDY^Tr(q)  ,  A*  >  2. 


Then  the  ambiguity  resolution  problem  is  to  estimate 
the  Doppler  frequency  fD  from  the  noisy  data  yk(n)  in 
(2).  It  is  usually  assumed  that  fD  is  in  a  certain  range, 
i.e.,  I/I  <  /max.  The  conventional  detection  method  is 
the  following  maximum  likelihood  estimation.  Find 
fD  that  maximizes  the  following  maximum  likelihood 
function 


From  yk(n)  in  (7)  what  we  can  get  for  fk  is,  however, 
its  folded  version  fk,  i.e., 

fk  =  fk  +  /,  /  is  an  unknown  integer  and  0  <fk<  1. 

(10) 

In  this  case,  the  reduced  frequency  fr  cannot  be 
obtained jrom  fk  by  simply  taking  their  mean  as 
Uip+x  +/2p+2)/2.  However,  when 

\f2p+\  ~flp+2\  <  0.5  (11) 


J  2Nb  n,  -  I  / 

Uf)  =  Z  Y,yk(mK(J)e*P(-j2*fnTr(k))\ 

'k  =  im  =  0  I 

(4) 

i.e., 

Li fD)  =  max  L(J) 

/ ,  —/mix 

where  ak{f)  takes  the  form  (3)  with  fD  replaced  by 
f  and  ;/0 1  <  f^x.  This  is  a  matching  process  and  / 
needs  to  run  all  real  numbers  from  -/max  to 
Clearly,  it  has  infinite  many  trials  and  therefore  is 
impossible  to  have  an  exact  match. 

In  [I],  Ferrari-Berenguer-Alengrin  proposed  an 
alternative  two  step  approach  for  the  above  problem 
without  implementing  infinite  many  trials,  where 
particular  N(k)  in  (1)  were  used.  We  next  want  to 
briefly  describe  this  two  step  approach. 

Let  iV( 2/7  +  1)  be  a  positive  integer  and  set 

N(2p  +  2)  =  -N(2p  +  1),  for  p  =  0,1 . Nb-  1. 

(5) 

The  Doppler  frequency  fD  is  decomposed  into  its 
integer  pan  (the  ambiguity  order)  nr  and  fractional 
part  (the  folded  or  reduced  frequency)  fr  as 

/o=/r+nr  wi*  0</r<l.  (6) 

Then  (2)  becomes 

y*(n)  =  ak{fD)txp  (j2tt  (fr  +  n)  +  bk(n), 

0<n  </»,-!.  (7) 


the  reduced  frequency  fr  can  be  recovered  from  fk  by 
taking  the  “circular  mean”  [5]  as 

l(P)  =  ^anSlefexP02-/2p.i)  +  expO'2jr/2/>+2)] 

(12) 

where  angle(c)  is  the  phase  angle  in  radians  in  [0j2;r) 
of  the  complex  number  z.  With  total  Nb  pairs  of  fk,  the 
overall  estimate  of  the  reduced  frequency  fr  is 


1 

fr  =  ^ angle  £  exp(J2nfr(p)).  ( 1 3) 

/>= o 

When  the  Doppler  frequency  /  in  (4)  is  split  into 
its  reduced  frequency  part  /  (without  confusion  in 
understanding  we  also  use  /  to  denote  the  reduced 
frequency)  and  its  ambiguity  order  part  n,  the 
maximum  likelihood  function  in  (4)  can  be  written 
as 


L(J,n)  = 


n,  - 1 


*=l  m= 0 


2 

•  exp(-/27r(/  +  n)mTr(Jc)) 


2M>  nt-\ 

XU  aW' n)  J2  >*(m)  exp(-y27r/mr  r(k)) 

k  -  1  m  =0 


eXP 


(14) 
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where  ak(f*n)  corresponds  to  the  term  ak(f )  in  (4) 
and  can  be  expressed  as 


i  =  exp( j2~fnt2p) 


(15) 


f m  =  exp  (,2 rf„,  (  2p  .  I  ♦  ) 

exp(j2:7"'i«2^rn)' 


(16) 


After  the  reduced  frequency  fr  is  estimated  as  in  (13), 
the  maximum  likelihood  function  Lif.n)  for  both  / 
and  n  is  reduced  to  the  one  for  the  ambiguity  order  n 
only: 

L(n)  =  L(fr,n) 


2.v#  __  n,-\ 

=  .^2 akOr’  n)  yk(m'>eKP(~j2nfrmT r(*» 

4=1  m=0 


where  ri  ranges  all  integers  from  -nmax  to  nmax  and 
nm*x  xs  r^e  maximum  ambiguity  order  corresponding 
to  the  maximum  Doppler  frequency  Thus,  the 
searching  of  the  Doppler  frequency  fD  from  all  the 
real  numbers  \f\  <  /max  to  maximize  L(f)  in  (4) 
becomes  the  searching  of  the  ambiguity  order  n 
from  all  integers  \n\  <  nmax  to  maximize  L(n)  in  (17). 
Note  that  there  are  only  finite  many  possibilities  of 
n,  which  makes  the  exact  coincidence  of  the  true 
ambiguity  order  possible.  Let  hr  denote  the  optimal 
ambiguity  order  estimate  from  L{n)  in  (17).  Then  the 
final  Doppler  frequency  estimate  is 

fD  */,+«,.  (18) 


Assume 

<  1.  K(k)\>\+nmjx.  (20, 

In  this  case,  although  the  circular  mean  (12)  may  not 
be  equal  to  the  reduced  frequency  fr  in  (8),  fr  takes 
one  of  the  following  five  values: 

Jr(p).  f  r(p)~  0.5,  Jr(p)  +  0.5 , 

fr(p)-l.  fr(p)+  1 

where  fr(p)  is  the  conventional  mean,  fr(p)  = 

(72p+1  +  /2p+ 2>/2.  311(1  fk  are  obtained  from  (7) 
and  (10).  It  is  because  the  unknown  parameter  /  in 
(10)  may  only  take  0,  -1  or  1,  when  the  condition 
(20)  holds  and  0  <fr<  1 .  Thus,  when  Nb  =  1 ,  the 
estimation  of  fr  and  n,  become  the  search  of  the 
optimal  7r(p)  3,1(1  nr  in  the  maximum  likelehood 
function  L(fr,n)  in  (14)  among 

fr  6  Sip)  =  {/ rip)  J r(P)  ~  0.5,  fr(p)  +  0.5, 

7r(p)-lJr(P)+l}  (22) 


and  [n|  <  nmax : 

^(fr(p)^r)=  max  L(f,n)  (23) 

feS(p)„n>  <71^ 

which  also  has  only  finite  comparisons. 

When  Nb  >  1,  there  are  at  least  two  methods  to 
take  the  advantage  of  this  multiplicity.  One  is  to 
take  the  circular  mean  of  all  the  above  estimated 
fr(p)  as  in  (13).  The  other  is  to  search  the  optimal 
/  among  all  possible  elements  in  the  sets  S(p)  for 
p  =  0,1 . Nb-  1: 

L(fr>*r)=  max  L(J,n)  (24) 

feStln  ;<>W 


The  reason  for  choosing  N(k)  as  integers  in  the 
whole  approach  is  to  use  the  discrete  Fourier 
transform  (DFT)  calculations  in  (17)  for  the  maximum 
likelihood  function  evaluations.  For  more  details  on 
the  implementation  issue,  see  (1). 

The  above  is  the  main  idea  for  the  Ferrari- 
Berenguer-Alengrin  two  step  estimation  method.  We 
call  it  FBA  method.  It  is  built  upon  the  assumption  (5) 
and  the  condition  (11).  Condition  (II)  guarantees  the 
accurate  reduced  frequency  estimation  and  leads  to  the 
following  condition  on  N(k): 

IAWI  >4(1  +nm„)f  l  <k<2Nb  (19) 

where  nmax  is  the  maximum  ambiguity  order.  Clearly, 
when  is  large,  \N(k)\  needs  to  be  large.  Large 
\N(k)\  may  increase  ambiguity  order  errors  as 
mentioned  in  [1].  One  way  to  relax  the  condition 
(1 1)  or  (19)  is  as  follows,  which  also  serves  as  a 
foundation  for  the  optimal  multiple  PRF  discussed 
latter. 


where 

S=  \J  Sip). 

P= 0 

Note  that  the  condition  (20)  can  be  further  relaxed  by 
allowing  more  possibilities  for  the  reduced  frequency 
fr  from  the  mean  fr.  Thus,  the  size  of  N(k)  can 
basically  be  arbitrary.  The  detection  method  in 
(20)-(24)  is  called  modified  FBA  method.  On  the  other 
hand,  the  condition  (5)  may  cause  high  sidelobes  of 
the  maximum  likelihood  function  L(Jnn)  in  (14)  and 
therefore  reduce  the  performance  when  additive  white 
noise  bk(n)  in  (2)  is  significant.  The  goal  of  the  rest  of 
this  paper  is  to  relax  the  condition  (5)  and  search  for 
the  optimal  linear  relationship  between  N(2p  +  1)  and 
N(2p  +  2)  instead  of  N(2p  +  2)  =  -N(2p  +  1). 

It  should  be  mentioned  that  another  difference 
between  the  FBA  method  and  the  above  modified 
FBA  method  is  the  following.  In  the  FBA  method, 
the  angular  mean  is  taken  over  the  Nb  bursts  as  shown 
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in  (13),  while,  in  the  modified  FBA  method,  the 
multiplicity  of  the  bursts  gives  more  possibilities  to 
search  for  the  correct  folded  frequency.  The  angular 
mean  may  reduce  the  error  variance  of  the  reduced 
frequency,  while  the  more  possibilities  of  the  search 
may  provide  more  accurate  estimate  of  the  reduced 
frequency.  However,  the  latter  one  clearly  causes  more 
computations. 


III.  OPTIMAL  MULTIPLE  PRF  AND  DOPPLER 
FREQUENCY  DETECTION 


0,1 . Nb-  1.  let 


S(p)  =  {fr(p)..fr(p)- 


1  +o 


Jr(P)± 


1+V 


^,fr(p)±\).  (29) 

*  +  ,‘p  ) 


When  ap  =  1,  the  set  Sip)  in  (29)  is  the  same  as  the 
set  Sip )  in  (22).  Similar  to  (21),  we  have 


fr£S(p).  p  =  0,1 . N„-\.  (30) 


In  this  section,  we  use  the  same  signal  model  as 
described  in  Section  II,  where  the  assumption  (5)  is 
relaxed  as 

N(2p  +  2)  =  -apN(2p  +  1), 


Let 


5  = 


AT*-! 

U  ■*(/>). 


P= 0 


(31) 


for  p  =  0, 1 . Nb-  1  (25) 


where  N(2p  +  1)  are  positive  integers  and  ap  are 
positive  real  parameters.  The  goal  of  the  rest  of  this 
paper  is  to  optimally  determine  the  parameters  ap 

given  N{2p  +  1)  for  p  =  0, 1 . Nb  -  1  in  terms  of  the 

lowest  sidelobes  of  the  maximum  likelihood  function 
L(f,n). 

With  (25),  an  analogous  formula  of  (9)  for  the 
reduced  frequency  is 


fr  = 


_  f2p+\  +  Qpflp  +  Z 


1  +  a„ 


p  =  0, 1 . Nb  -  1 


(26) 


where  fk  are  defined  in  (8).  One  can  see  that  the 
conventional  mean  (9)  with  the  property  (5)  becomes 
the  conventional  weighted  mean  (26)  with  the 
property  (25).  The  circular  mean  in  (12),  however, 
cannot  be  generalized  to  the  general  setting  of  the 
parameters  ap.  In  other  words,  the  reduced  frequency 
ft  can  not  be  obtained  as  in  the  FBA  method  from 

the  estimated  individual  fk  in  (8),  (10),  and  (25) 
with  general  parameters  ap  unless  ap  -  1  using  the 
periodogram  method.  Fortunately,  the  argument  in 
(20M24)  can  be  generalized  as  follows. 

Without  loss  of  generality,  we  assume  the  property 
(20),  i.e.. 


N(2p  +  1)  >  1  + 
|A'(2p  +  1)|  >  1  +  "max 

Let 


and 

P  =  0, 1 . N„-  1. 

(27) 


’  apf2p+2 


1  +  Q. 


p  =  0,1, ...,74-1 


(28) 

where  fk  are  obtained  from  (7),  (8),  and  (10) 
with  N(k)  satisfying  (25)  instead  of  (5).  For  p  = 


Then  the  maximum  likelihood  estimates  for  the 
reduced  frequency  fr  and  the  ambiguity  order  nr  are  fr 
and  hr  that  maximize  L(f,n )  for  /  €  S  and  |n|  <  nmax, 
i.e., 

i'(/r'”r)=  max  L(f,n)  (32) 

f€S..n  <«■« 

where  L(J,n)  is  similar  to  (14): 


|  ".-I 

£•(/»  =  !  Vai(/.tJ)5Zyi(rM)exp(-;2-/m7'r(k)) 
it  =  l  m= 0 


.exp( 


(33) 


where 


ak(f,n)  =  Aexp  ^j2~ns(J  +  n)^Tr(^)j 
Tr(q)  =  1  + 


(34) 


A Kg) 


and  yk(m)  are  the  demodulated  noisy  data  at  the 
receiver: 

yk(m)  =  ak(fr,nr)exp(j2-n  frmTr(k)) 

■  exp  +  (35> 


where  fD  =  fr  +  nr  is  the  unknown  Doppler  frequency 
and  bk(m )  are  additive^white  noise.  The  final  Doppler 
frequency  estimate  is  fD  =  fr  +  hr. 

The  performance  of  the  above  detection  method 
depends  on  the  property  of  the  maximum  likelihood 
function  L(Jtn).  The  lower  sidelobes  of  L(J,n) 
are,  the  better  performance  of  the  detection  is.  The 
sidelobes  depend  on  the  choice  of  the  parameters  ap 
in  (35),  when  N(2p  +  1)  are  given.  We  next  want  to 
discuss  the  optimal  choice  of  these  parameters. 
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By  substituting  (34)-(35)  into  (33),  we  have 

i  vv  ,  ft  -  I  \ 

Lif.n)  =  Mi1'  £  exp  (  j2 -/♦»,*  n,Z  ^  ) 


0  j . -  1,  and  nma,.  the  optimal  parameters  np. 

=  0  1  ..,/V  -  1.  can  be  obtained  by  minimizing 
the  cost  function  E,ldci.,be('*'>-'‘i . 0v,-P  in  (39)  tor 


np  >  0.  i.e., 


^^exp  lj2~t.tr  -  fvn  Trtk)  (exp  (./-•  V(^(  m) 


2Sh  /  * 

=  ;/t|2',^exp  |  j2-n,(/f  -m^Triq) 


£-ddobe^*0.«l . ".VI; 


min  £,ijeu.be('*0’oi . °.v;  - 1  * 

a0>Oo-Oi>ai . »VI>0S-' 


where,  by  (27), 


«p  ( j*(n,  - 1 )  (</,  -  /)W*)  ♦  )  )  0ne  may  see  that  an  explicit  solution  for  the  optimal 

V  n  _n  ,  q  is  not  possible.  However,  any  existing  optimization 

sin  f ~n  \(fr  -f)Tr(k) •*.  methods  work  for  the  above  problem. 

-Vr - rrrpr :  (36)  Let  us  consider  the  simpliest  case,  Nb  =  \An  this 

,»(*  case, 


i  + 

“p  N(2p  +  1) 


Clearly,  the  mainlobe  value  of  the  above  maximum 
likelihood  function  is  its  value  when  f  =  fr  and 

L<Jr,nt)  =  \A\22Nbns.  (37 

Since  fr  €  Sip).  the  offset  value  fr-fm  (36)  may 
only  take  the  values  in  the  following  set,  when  f  £  o 
defined  in  (30): 

•v»-i  ,  ±1  ±2  ±Op 

Offset  =  U  j  ±  l.±2,  i  +  j  +a~’  i  +  Q  ' 

/>= o  *■  p 

±2ap  ±(1-QP)  -jO  -Qp> 
T+~cTp'  l+Qp  '  1+Qp 
±(l±2a,)  ±(2±o,)\  (3I 

1  +  Qp  ’  l+ap  J 

The  offset  value  nr  -  n  is  in  the  set 

{rL . o*_,)  denote  the  total 

energy  of  all  the  sidelobe  values  of  the  maximum 
likelihood  function  Lif.n)  in  (36).  Then,  by 
normalizing  A  =  1  it  can  be  expressed  by 

£*,jelob<(Q0-Ql . 

f€.$oSu*  0<  a 

•' jX>P  [i2*nM  +  ('  +  atc?))] 

•exp  (j*ins-  1)(/(1  +  +  tf(ft))) 


±1  ±2  ±Qp 

±1,±2’  1  +  Gq  *  1  +Q0’  1  +  <V 

±2a0  ± (1  ~  Qp)  ±2(1  —  ctp) 

1  +q0’  l  +Qo  ’  1  +Qo 

±(1  ±  2q0)  ±(2±q0)\ 

1  +  q0  ’  1  +  Oq  / 


Esjde|obe(^0^  ^  ^  ^  ^ 


Nil))  Nil) 


.  sin  7rn 


sin  f  7r  j /  f  1  +  N(1)  J  +  W) 


+  exp  ^j2nns(J  +  n)  ^  1  ♦ 


exp(^(n,-l)[/^l-0oW)J  QqN{1) 

sin  [/  ('  ~  an,V(l))  ~  OqT/Q),  ) 


sin  (  it  /  (  1  - 


Nik))  Nik) 


sm  (*"*[/( - Wo)  (39)  jv  NUMER)CAL  EXPERIMENTS 

s^  y  \  +  Af(/c)J  Nik)  )  In  this  section,  we  present  numerical  examples 

„  Mrtnj-n  b-0  1  Nu  -  1,  to  compare  the  performances  for  the  modified  FBA 

a.1;  ;S'(38)pgi™„V(2P  ; -  *«  «*>*  op.-,™*  m* 
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Q0  Ni\)J  Oo  N(l) 


Let  us  see  some  numerical  examples  of 
Esideiobe(Qo)-  Consider  N(l)  =  40  and  ns  =  12.  Figs.  2, 

3,  and  4  show  the  £sideiobe(Qo)  versus  Qo  when  nm» 

3(  5>  and  12,  respectively.  One  can  see  that  the  optimal 
a0  strongly  depends  on  the  maximal  ambiguity  order 
n  ,  where  the  optimal  a0  are  q0  =  0.57,  1.85,  and 
2mo“  for  =  3,  5,  and  12,  respectively. 


Fig.  2  fsiddobc(Q0^  whcn  W)  =  40-  ns  -  12’  and  "max  =  2-  Optimal  a0  =  0.57. 


proposed  in  this  work.  The  following  parameters  are 
used  in  our  simulations:  N(  1)  =  40,  Nb  =  1 1  ns  -  12, 
and  jV( 2)  =  -o0N(l),  where  q0  =  1  for  FBA  method 
and  q0  the  optimal  q0  for  the  method  proposed  in  this 
work.  The  additive  noise  bk(n)  in  the  known  noisy 
radar  data  )>*(«)  in  (2)  is  assumed  white  Gaussian 
noise  with  mean  0  and  variance  a2.  As  mentioned  at 
the  end  of  Section  III,  the  optimal  aQ  depends  on  the 
maxima!  ambiguity  order  n^.  Two  different  n ^ 
are  tested:  nmix  =  3  and  12.  Let  M  be  the  number 
of  signal  realizations.  Let  fD(k)  be  the  true  Doppler 
frequency  and  fD(k)  be  the  estimated  one  at  the  kih 
signal  realization.  Then  the  mean  squared  error  (MSE) 
is  calculated  as 

MSE  =  (43) 


The  signal-to-noise  ratio  (SNR)  for  the  additive 
Gaussian  noise  is  calculated  by  SNR  =  A2 /a2,  where 
A  is  the  transmitted  signal  amplitude. 

When  nmax  =  3  and  N(l)  =  -JV( 2)  =  40  >  4(1  +  3) 
=  16,  i.e.,  the  condition  (19)  or  (1 1)  holds  for  the 
accurate  circular  mean  formula  (12).  The  FBA  method 
works  in  this  case  although  the  parameter  a  =  1 
is  not  optimal  in  terms  of  the  sidelobe  values  of 
the  maximum  likelihood  function  fside^fao).  The 
optimal  parameter  q0  in  this  case  is  q0  =  0.57  as 
studied  in  Section  III.  When  a0  =  0.57,  clearly  the 
number  N( 2)  =  -0^(1)  =  22.8  is  not  an  integer.  For 
the  DFT  computation  purpose,  rounding  a  =  0.57  to 
q0  =  0.6  may  be  needed  for  N  (2)  to  be  an  integer. 
When  q0  =  0.6,  W(2)  =  -24.  As  mentioned  in  Section 
III,  when  q0  f  1 ,  the  accurate  circular  mean  no  longer 
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44$, 


Fig  5  Comparison  of  reciprocal  MSE  of  Doppler  frequency  estimations:  FBA  method  and  modified  FBA  method  with  optimal  PRFs 
(or  a  )  Solid  line  modified  FBA  method  with  optimal  Qq  =  0.57;  dashdot  line:  modified  FBA  method  with  rounded  optimal  oQ  =  0.6; 
*°  dashed  line:  FBA  method.  Af(I)*  40,  n%  *"  ’  jf  “ 

holds.  In  this  case,  we  use  the  modified  FBA  method 
for  the  Doppler  frequency  detection.  20,000  Monte 
Carlo  tests  are  implemented,  i.e.,  M  =  20,000  in  (43). 

Three  curves  are  plotted  in  Fig.  5  for  the  reciprocal 
MSE,  1/MSE,  of  the  Doppler  frequency  estimations. 

The  solid  line  is  for  the  modified  FBA  method  with 
the  optimal  a  =  0.57;  the  dashdot  line  is  for  the 
modified  FBA  method  with  the  rounded  a0,  0.6;  the 
dashed  line  is  for  the  FBA  method.  A  significant 
improvement  of  the  MSE  at  the  transition  SNR  band 
can  be  clearly  seen. 

As  a  remark,  when  a0  =  1,  the  FBA  method  and 
the  modified  FBA  method  both  work.  From  our 


=  i maximal  amoiguuy  orocr 


numerous  numerical  examples,  these  two  methods 
have  the  same  performance  in  this  case. 

When  =  12  and  jV(1)  =  -W(2)  =  40  < 

4(1  +  12)  =  52,  i.e.,  the  condition  (19)  or  (1 1)  for  the 
accurate  circular  mean  formula  (12)  does  not  hold.  In 
this  case,  the  FBA  two  step  method  does  not  work 
as  shown  in  Fig.  6  and  the  modified  FBA  method 
should  be  used.  The  optimal  parameter  a0  is  a0  = 
2.01.  10,000  Monte  Carlo  tests  are  implemented,  i.e., 
M  =  10,000  in  (43).  Similar  to  Fig.  5,  three  curves  are 
plotted  in  Fig.  6  for  the  reciprocal  MSEs.  The  solid 
line  is  for  the  modified  FBA  method  with  the  optimal 
a0  =  2.01.  The  dashdot  line  is  for  the  modified  FBA 
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Fig  6.  Comparison  of  reciprocal  MSE  of  Doppler  frequency  estimations:  FBA  method,  modified  FBA  method,  and  modified  FBA 
method  with  optimal  PRFs  (or  o0).  Solid  line:  modified  FBA  method  with  optima]  a0  =  2.01;  dashdot  line:  modified  FBA  method; 
dashed  line:  FBA  method.  N{  1)  =  40,  ns  =  12,  maximal  ambiguity  order  nmlx  =  12. 


method  with  q0  =  1.  The  dashed  line  is  for  the  FBA 
method.  From  Fig.  6,  one  can  clearly  see  that  in  this 
case  the  FBA  method  fails,  and  the  modified  FBA 
method  with  the  optimal  q0  outperforms  the  one  with 
nonoptimal  q0. 

V.  CONCLUSION 

In  this  paper,  we  studied  the  Ferrari-Berenguer- 
Alengnn’s  two  step  Doppler  frequency  detection 
method,  where  the  folded  frequency  is  first  estimated 
using  the  circular  mean  and  the  ambiguity  order  is 
then  estimated  using  the  quasi  maximum  likelihood 
criterion.  The  accuracy  of  the  folded  frequency 
depends  on  the  use  of  the  particular  pairs  of  PRFs. 
When  the  folded  frequency  is  not  equal  to  the  circular 
mean,  we  modified  the  FBA  method  with  a  finite 
possibilities  of  the  folded  frequency  and  the  ambiguity 
order.  More  importantly,  we  studied  and  formulated 
the  optimal  PRFs  in  the  modified  FBA  method  in 
terms  of  minimizing  the  total  sidelobe  energy  of  the 
maximum  likelihood  function.  Better  performance  of 
the  modified  FBA  method  over  the  FBA  method  was 
shown  by  numerical  examples. 
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Abstract 

An  N  x  K  (N  ^  K)  ambiguity  resistant  (AR)  matrix  G(z)  is  an  irreducible  polyno¬ 
mial  matrix  of  size  N  x  K  over  a  field  F  such  that  the  equation  EG(z)  —  G(z)V(z)  with  E 
an  unknown  constant  matrix  and  V(z)  an  unknown  polynomial  matrix  has  only  the 
trivial  solution. E  =  oJNy  V(z)  —  odK,  where  a  €  F.  AR  matrices  have  been  introduced 
and  applied  in  modem  digital  communications  as  error  control  codes  defined  over  the 
complex  field.  In  this  paper  we  systematically  study  AR  matrices  over  an  infinite  field  F. 
We  discuss  the  classification  of  AR  matrices,  define  their  normal  forms,  find  their 
simplest  canonical  forms,  and  characterize  all  (K  +  1 )  x  K  AR  matrices  that  are  the 
most  interesting  matrices  in  the  applications.  ©  1999  Elsevier  Science  Inc.  All  rights 
reserved. 

AMS  classification:  15A21;  15A24;  94A12;  94B10;  94B12 

Keywords:  Irreducible  matrix;  Ambiguity  resistant  matrix;  Polynomial  matrix;  Error  control  coding 


1.  Background  and  introduction 

An  error  control  code  defined  over  the  complex  field  C  maps  each  K  input 
samples  into  N  output  samples,  where  N  is  usually  greater  than  K  so  that  the 
code  is  used  to  resist  errors  in  a  channel  and  the  code  is  called  N  x  K  code.  An 
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N  x  K  linear  error  control  code  is  usually  represented  by  an  N  x  K  polynomial 
matrix  G{z),  where  each  entry  of  the  matrix  G(z )  is  a  polynomial  of  the  variable 
z  (or  the  delay  variable  z~l  in  engineering)  over  complex  field  C.  Let  X(z)  be  a 
K  x  1  polynomial  matrix  (or  vector)  as  an  input  signal.  Then  Y(z)  =  G(z)X(z), 
is  the  N  x  l  polynomial  matrix  (or  vector)  of  the  code  output  signal,  which  is 
usually  transmitted  through  a  real  world  channel,  wired  or  wireless.  In  a 
channel  there  are  two  common  distortions.  One  is  an  additive  random  noise 
and  the  other  is  the  so-called  intersymbol  interference  (ISI).  An  additive  ran¬ 
dom  noise  means  that  the  received  signal  is  Y(z)  =  G(z)X{z)  +  t]{z)  instead  of 
Y(z),  where  tj{z)  is  the  polynomial  vector  of  the  additive  noise.  Notice  that  the 
above  additive  noise  only  affects  each  individual  sample  of  the  received  signal. 
The  ISI  is  another  type  of  distortion  in  a  channel,  which  is  usually  due  to  a  high 
speed  transmission  and  cause  distortions  between  received  samples.  Mathe¬ 
matically,  the  ISI  is  an  M  x  N  polynomial  matirx  H(z)  and  the  received  signal  is 

Y(z)  =  H(z)Y(z)  =  H{z)G(z)X(z),  (1-1) 

where  G(z)  is  an  error  control  code. 

Resistance  to  an  additive  random  noise  means  that  the  input  signal  X (z)  can 
be  restored  from  the  received  Y (z)  distorted  by  an  additive  noise  t](z)  without 
knowing  tj(z).  To  achieve  this  goal,  the  distance  between  codewords 
Y(z)  =  G(z)X(z)  after  an  error  control  code  plays  the  most  important  role,  see 
for  example  Ref.  [1],  which  is  beyond  the  scope  of  this  paper.  Similarly,  re¬ 
sistance  to  the  ISI  means  that  the  input  signal  X{z)  can  be  recovered  from  the 
received  Y{z)  in  Eq.  (1.1)  distorted  by  an  ISI  H(z)  without  knowing  H(z).  To 
achieve  this  goal,  ambiguity  resistant  (AR)  matrices  have  been  introduced  in 
Refs.  [6,7],  which  are  based  on  the  irreducibility  of  polynomial  matrices  defined 
over  the  complex  field  C.  In  this  paper,  a  general  infinite  field  F  is  considered. 
In  what  follows,  F  denotes  an  infinite  field  unless  otherwise  specified. 

The  definition  of  irreducible  polynomial  matrices  over  F  induced  from  Ref. 
[3]  is  as  follows. 

Definition  1.  An  N  x  K  (N  >  K)  polynomial  matrix  G(z)  over  F  is  irreducible  if 
and  only  if  there  is  no  K  x  K  polynomial  matrix  R(z)  over  F  with  non-constant 
determinant  such  that  G{z)  =  Q(z)R{z),  where  Q(z)  is  an  N  x  K  polynomial 
matrix  over  F. 

The  irreducibility  can  be  characterized  by  the  following  lemma  which  offers  am 
easy  method  to  judge  the  irreducibility  of  a  matrix  when  F  is  algebraically  closed. 

Lemma  1.  An  N  x  K  (N  >  K)  polynomial  matrix  G(z)  over  an  algebraically 
closed  field  F  is  irreducible  if  and  only  if  it  is  full  column  rank  (i.e., 
rank(G(z))  =  K)  for  any  z  €  F. 
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Remark  1.  In  Ref.  [3]  the  definition  of  the  irreducibility  of  a  polynomial  matrix 
over  C  is  given  by  the  necessary  and  sufficient  condition  in  Lemma  1.  However, 
when  F  is  not  algebraically  closed,  the  irreducibility  in  Definition  1  is  not 
_  equivalent  to  the  necessary  and  sufficient  condition  in  Lemma  1.  As  an 
example,  polynomial  matrix  (z2  +  l^z2  -f  2)1  n  has  full  rank  for  any  real  z  e  R. 
It  is,  however,  obviously  reducible. 

Definition  2.  An  N  x  K  (N  ^  K)  irreducible  polynomial  matrix  G(z )  over  F  is 
called  AR  if  and  only  if  the  following  equation 

EG(z)  =  G(z)  V  (z)  (1.2) 

with  E  an  unknown  constant  matrix  and  V{z)  an  unknown  polynomial  matrix 
over  F  has  only  the  trivial  solution  E  =  adN,  V{z)  —  cdKi  where  a  €  F,  IN  and  IK 
are  N  x  N  and  K  x  K  identity  matrices,  respectively. 

It  has  been  proved  in  Refs.  [6,7]  that,  if  the  code  G(z)  in  Eq.  (1.1)  over  the 
complex  field  C  is  AR,  then  the  input  signal  X(z)  can  be  blindly  recovered  from 
the  received  signal  Y(z)  in  Eq.  (LI),  where  the  knowledge  of  the  ISI  channel 
H(z)  is  not  necessary.  Therefore,  the  resistance  of  the  ISI  can  be  achieved  by 
choosing  a  code  G(z)  to  be  AR. 

Some  necessary  conditions  for  a  code  G(z)  over  C  to  be  AR  are  given  in  Ref. 
[6],  for  example,  G(z)  is  not  a  constant  matrix,  and  N  >  K.  Furthermore,  it  has 
been  proved  in  Refs.  [6,7]  that  the  following  N  x  (N  —  l)  codes  G(z)  are  AR 
over  C 


1 

0 

0 

...  0 

0  1 

z" 

l 

0 

...  0 

0 

0 

z' 

1 

...  0 

0 

0 

0 

0 

...  /■ 

1 

.  0 

0 

0 

...  o 

zr  . 

(1.3) 


for  any  positive  integer  r.  In  Ref.  [8],  the  following  N  x  (N  -  1)  polynomial 
matrices 


G(z)  = 


■  1 

0 

0 

0  ■ 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

1 

-fi(z) 

Fi{z) 

W 

...  F*_,(z)_ 

(1.4) 
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have  been  studied.  The  following  necessary  and  sufficient  condition  for  G{z)  in 
Eq.  (1.4)  to  be  AR  is  given  in  Ref.  [8]: 

{l,F1(z),F2(z),...,^_l(z)} 

is  linearly  independent  over  C,  which  can  be  also  seen  in  Theorem  4  in  this 
paper  for  a  general  infinite  field  F.  The  codes  G(z)  in  Eq.  (1.4)  are  called  sys¬ 
tematic  codes,  which  is  analogous  to  the  conventional  error  control  codes  de¬ 
fined  over  a  finite  field  [2]  for  the  encoding  convenience. 

In  this  paper,  we  systematically  study  AR  matrices  over  F.  We  provide 
canonical  forms  for  all  N  x  K  AR  matrices  and  characterize  all  N  x  (N  -  1) 
AR  matrices.  The  characterization  is  easy  to  use.  Since,  in  coding  applications, 
K  samples  are  expanded  to  N  samples,  when  an  N  x  K  code  is  used.  This  ex¬ 
pansion  means  that  the  bandwidth  needs  to  be  expanded  in  a  transmission, 
which  is  usually  expensive.  Therefore,  the  smallest  sample  expansion  in  coding 
is  usually  desired.  Clearly,  the  codes  of  size  N  x  (N  -  1)  provide  the  smallest 
bandwidth  expansions,  and  therefore  are  the  most  interesting  codes  in  appli¬ 
cations.  The  characterization  of  all  N  x  (N  -  1)  AR  matrices  provides  the 
opportunity  to  search  the  optimal  one  in  resisting  other  distortions,  such  as  the 
additive  random  noise  as  mentioned  before.  Some  results  have  been  obtained 
in  Refs.  [8,9]  along  this  direction. 

This  paper  is  organized  as  follows.  In  Section  2  we  present  the  canonical 
forms  for  N  x  K  AR  matrices.  In  Section  3,  we  provide  the  necessary  and 
sufficient  conditions  for  a  polynomial  matrix  of  size  N  x  (N  -  1 )  to  be  AR  in 
terms  of  its  systematic  or  canonical  form. 

2.  Classification  of  AR  matrices  and  canonical  form 

Let  F[z]  denote  the  polynomial  ring  over  an  infinite  field  F.  Let  Matxa:(F[z]) 
denote  the  set  of  all  AT  x  K  matrices  with  elements  in  F[z]. 

Definition  3.  The  transformation  TPQ  of  MWx^(F[z])  defined  by 

Tf,q{A)  =  PAQ  for  all  A  €  M*xjr(F[z]), 

where  P  and  Q  are  N  x  N  and  K  x  K  unimodular  polynomial  matrices  (i.e., 
their  determinants  are  non-zero  constants),  is  called  an  equivalence  transfor¬ 
mation  of  M\x*(F[z]). 

It  is  well  known  that  the  set  of  all  equivalence  transformations  of. 
MjVx*(F[z])  is  a  group  of  transformations  with  T,uJk  the  identity  transforma¬ 
tion,  and  with  the  formulas  TPqTrs  =  TPRSq  and  7),^  =  TP-\.q- \.  This  group 
induces  an  equivalence  relation  on  MArx*(F[z])  and  two  matrices  A  and  B  are 
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said  to  be  equivalent  over  F  if  there  exists  an  equivalence  transformation  Tp@ 
such  that  TPtQ{A)  =  B.  Since  polynomial  ring  F[z]  is  a  principal  ideal  ring,  we 
know  that  every  matrix  A  €  M^xA:(F[z])is  equivalent  to  a  diagonal  matrix  Z)(z), 
which  is  known  as  the  Smith  form  decomposition  [4,5].  This  general  theory 
applies  to  AR  matrices,  but  the  equivalence  relation  defined  above  does  not 
preserve  the  AR  property.  To  do  so,  we  define  AR-equivalence  transforma¬ 
tions  as  follows. 

Definition  4.  An  equivalence  transformation  Tp,q  is  called  an  AR-equivalence 
transformation  if  and  only  if  P  is  a  non-singular  constant  matrix  and  Q  is  a 
unimodular  polynomial  matrix. 

From  now  on,  in  order  to  avoid  confusion,  A  will  represent  a  constant 
matrix  and  A(z)  will  represent  a  polynomial  matrix  unless  otherwise  specified. 
We  have  the  following  result. 

Theorem  1.  An  AR-equivalence  transformation  preserves  the  AR  property , 
an  N  x  K  polynomial  matrix  G(z)  is  ambiguity  resistant  if  and  only  if  PG{z)Q{z) 
is  ambiguity  resistant  for  any  N  x  N  invertible  constant  matrix  P  and  any 
unimodular  polynomial  matrix  Q(z). 

Proof.  Let  E\ PG(z)Q{z)  =  PG{z)Q(z)Vfz).  Then  P~xExPG{z)  =  G{z)Q(z)Vfz) 
£H(z).  Hence  the  ambiguity  resistance  of  G(z)  implies  that  Q(z)V\(z)Q~l (z) 
=  odK  and  P~XE\P  =  which  implies  E\  =  a/#,  V\  (z)  =  cdK  for  some  non¬ 
zero  constant  a  €  F.  On  the  other  hand,  if  PG(z)Q(z)  is  AR,  EG(z)  =  G(z)V(z) 
means  PEP~l[PG(z)Q{z)}  =  [PG{z)Q{z))Qr\z)V (z)Q(z)  which  means  PEP’1 
—  a IN  and  ^’(zlFfzlOfz)  =  odK  for  some  non-zero  a  €  F.  Hence  E  =  a/^, 
V(z)  =  a IK  and  G(z)  is  AR.  □ 

We  can  easily  check  that  AR-equivalence  transformations  form  a  subgroup 
of  equivalence  transformations.  They  also  induce  an  equivalence  relation 
among  AR  matrices.  We  call  G|(z)  and  G^z)  AR-equivalent  if  there  is  an  AR- 
equivalence  transformation  7>0  such  that  TPgG\(z)  =  G2(z). 

For  an  irreducible  N  x  K  polynomial  matrix  G(z),  we  can  check  that  G(z)  is 
equivalent  to  matrix  [4, 0]T  (if  N  >  K)  where  Ar  means  the  transpose  of  matrix 
A  and  0  is  the  K  x  {N  -  K)  matrix  with  0  entries.  We  now  want  to  seek  a  simple 
form  of  matrix  G(z)  under  AR-equivalence,  which  is  useful  for  AR  charac¬ 
terization.  The  following  proposition  was  given  in  Ref.  [6],  where  result  (a)  is 
useful  later  as  a  necessary  condition  on  AR  matrices,  and  result  (b)  makes  us 
only  need  to  consider  the  case  N  >  K. 

Proposition  1.  If  an  N  x  K  (N  ^  K)  polynomial  matrix  G(z)  over  F  is  AR ,  then 

(a)  G{z)  is  not  AR-equivalent  to  a  matrix  whose  first  column  is  (1,0, ...  ,0)  ; 

(b)  N  >  K. 
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Proof,  (a)  Suppose  G(z )  is  AR-equivalent  to  a  matrix  with  first  column 
(1,0,...,  0)T,  then  by  simple  equivalence  it  follows  that  G(z)  is  AR-equivalent 
to  a  matrix  of  the  form 


G,(z)  = 


1  0 

.0  H{z)_ 

where  H(z)  is  an  (N  -  1)  x  (K  -  1)  polynomial  matrix.  Setting 


E  = 


2  0 

0  In-\  J  ’ 


and  F(z)  — 


0 

Ik-x 


we  see  that  EG\{z)  —  Gl(z)V(z)  and  V(z)  ^  cdK  for  any  non-zero  constant 
a  €  F.  In  other  words,  G(z )  is  not  AR. 

(b)  If  N  =  K,  then  the  irreducibility  of  G(z)  means  G(z)  is  unimodular.  So 
for  any  E,  V(z)  =  G~'(z)EG(z)  satisfies  EG(z)  =  G(z)V(z).  So  G(z)  is  not  AR. 

□ 


Lemma  2.  Any  polynomial  matrix  A(z)  €  Matxjt(F[z])  with  rank  =  K  is  AR- 
equivalent  to 


’£n(z) 

0 

0 

0  ' 

g2  l(z) 

gn(z) 

0 

0 

£32  (*) 

£33(2)  •  • • 

0 

gK\  (Z) 

gKl{z) 

£o(z)  •  •  • 

gKK(z) 

,gm{z) 

gN  2  00 

gN  3(2)  •  •  • 

gNK(z ). 

deg(gu 

(z))  <  degfeOO)  <  •  • 

■  ^deg(| 

(2.1) 


deg {gu{z))for  any  j  <  i. 


Proof.  Let  A(z)  be  an  N  x  K  matrix  with  entries  a,y(z).  Let 
d,{z)  =  GCD(a,i(z), . . . . aiK (z) ) .  By  row  permutation  only  we  may  assume  that 
dj(z)  ±  0  and  deg  dt  is  non-decreasing  with  /  for  i  =  1, . . . ,  K.  Now  A(z)  is  AR- 
equivalent  to  (by  only  column  transforms) 

'  d,(z)  0  ...  0  ' 

*21  (z)  *22  (z)  •••  *2x(z) 

.*vi(z)  bN2(z)  ...  bNK{z). 

Furthermore,  deg[GCD(A,2(z), . . . ,  biK{z))\  ^  deg[GCD(*,i  (z), . . . , blK{z))\ 
^  deg  di  >  deg  d\  (z)  for  i  =  2, . . . ,  N.  Similarly  we  can  deal  with  the  submatrix 
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bn(z) 


B(z)  = 


bN  i{z) 


bixiz) 


bm{z) 


with  rank(S)  =  K  -  1.  By  induction  the  lemma  is  proved.  □ 

Remark  2.  Form  (2.1)  has  a  direct  relationship  with  row-Hermite  forms  and  the 
above  lemma  can  also  be  proved  by  using  Theorem  6.3-2  in  Ref.  [3],  From  Ref. 
[3]  the  row-Hermite  form  of  a  matrix  A(z)  is  equal  to  A(z)Q(z)  where  Q{z)  is  a 
unimodular  K  x  K  matrix.  By  row  permutations,  it  is  guaranteed  that  the 
diagonal  elements  gjj(z)  are  non-zero  and  deg  gu  <  deg  gj,  if  1 
Using  column  operations  again  the  polynomial  matrix  can  be  reduced  to  the 
form  in  Lemma  2. 

Lemma  3.  For  L  polynomials  f\ (z)  0,/2(z), . . .  Ji{z)  over  F,  if  deg(GCD 

(c/i+/2,/3,-..,/i))>deg/i  for  any  constant  c  €  F,  then  /1I/2, /1I/3, 

•  •  •  ,/il/i- 

Proof.  We  first  prove  the  case  L  =  3.  It  is  obvious  if  f\  is  a  constant.  Now 
suppose  deg/i  >  1  and  dc(z)  =  GCD(c/i -f/2,/3).  Then  deg  dc  ^  1 .  Let 
d  =  GCD(/i,/2,/j).  Then  /1  =  dg\J2  =  dg2,ft  =  dgi  and  GCD(gi,g2,g3) 
=  1,  deg(GCD(cgi  +  g2,g3))  >degg,.  But  based  on  the  fact  that  if 
GCD(gi,g2,g3)=  1  over  an  infinite  field  F  then  there  exists  c  6  F  such  that 
GCD(cgi  +  g2 1  g3 )  =  L  Hence  the  above  two  cases  mean  GCD(cgi  +  g2,g3) 
=  1.  Therefore  we  have  GCD(c/i  +  f2,fi)  =  d  x  GCD(cgi  +  gi- gi)  =  d.  Now 
deg  d  ^  deg/i  and  df,  imply  d{z)  =  cf\(z)  for  some  non-zero  constant  c. 
Hence  f\\fi,  f\\fi-  For  general  L  we  know  deg(GCD(c/i  +  f2,fy, ... ,//.)) 
=  deg(GCD(c/i +/2,GCD(/3,...,/t))).  By  the  above  proof, /i|/2)/i|GCD 

/t).  Hence /1 1/2, /1 1/3,..., /lUt-  □ 

Lemma  4.  lfG(z)  =  ( g<,(z ))  is  a  non-zero  matrix  in  MWx*(F[z])  of  the  form  (2.1) 
and  if  g\\{z)  f  0  is  an  element  of  G(z)  with  m  =  deg(gi  1 )  < deg(g//)  for  any 
g,j(z),  then  either  gu(z)  divides  all  g,,(z),  or  else  there  exists  an  AR-equivalence 
transform  T  such  that  T(G(z))  =  H(z)  has  the  form  (2.1)  and  h\\{z)  f  0  is  of 
degree  less  than  m. 

Proof.  Suppose  gn{z)  does  not  divide  every  element  of  G{z).  By  Lemma  3, 
there  exists  a  constant  c€F  and  i,2^i^N  such  that  deg(GCD(cgn 
+giiigi2i  -  *  •  ,gu,  •  ..£,*)  <  deg  gu  -  m.  This  means  that  G(z)  is  AR-equivalent 
to  a  matrix  with  /-row  ( cgw  +  g,\>g,2 —  ,gih  •  •  •  >£/*)•  Now  Lemma  2  guaran¬ 
tees  that  G(r)  is  AR-equivalent  to  H(z)  of  form  (2.1)  with  deg  h\\  ^deg(GCD 
(cgu  +gt\,gi2,--’,gu . git;)  <  m-  □ 
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Combining  Lemmas  2  and  4  we  can  obtain  the  following  result. 

Theorem  2.  Any  non-zero  matrix  A(z)  £  MatxA:(F[z])  with  rank  =  k  is  AR- 
equivalent  to  a  matrix  of  the  following  form 


0 

0 

0 

0  ... 

0  • 

g2l(z) 

222  (z) 

0 

0 

0  ... 

0 

2*1  (z) 

2*2  (z) 

2*3(2)  ... 

2**(z) 

0  ... 

0 

-gm{z) 

gNl(z) 

2«(z)  •  •  • 

gNk(z) 

0  ... 

0  . 

with  ga\g{i ■+!)(/+!), gu\gji  for  any  /  =  1 , 2,  1  and  j  Ss  /. 


Proof.  Obviously,  A(z)  is  AR-equivalent  to  a  matrix  of  form  [B  0]  where  B  is  an 
N  x  k  matrix  with  rank(fi)  =  k.  By  Lemma  2,  we  have  that  any  non-zero  matrix 
is  AR-equivalent  to  a  matrix  as  above  such  that  g\\(z)  has  the  minimum 
degree.  If  g,,(z)  divides  all  g*/(z)  for  any  k,l  ^  i,  Theorem  2  is  proved.  If  g(i(r) 


not  divide 

some  gki(z)  for  some  k,  l  ^  /, 

we  then 

2/i  (z) 

0 

0 

0  • 

2(«+iy(z) 

2(/+i)(/+i)(z) 

0 

0 

2*1  (z) 

2*(/+i)(z) 

2*(/+2)  (z)  ... 

2**(z) 

.  gm  (z) 

2iV(/+i)(z) 

2/V(/+2)(z)  ... 

2m  (z). 

Therefore,  by  Lemma  4,  under  AR-equivalence  we  have  that  g„(z)  divides  all 
gki(z)  for  any  k,  l  >  i.  □ 


By  the  above  theorem,  for  irreducible  matrices,  we  have  the  following  result. 

Theorem  3.  Any  irreducible  matrix  in  M^xa:(F[z])  is  AR-equivalent  to  a  matrix  of 
the  following  form 


'  1 

0 

0 

0 

0  ‘ 

0 

1 

0 

0 

0 

0 

0 

0 

1 

0 

2*i  (z) 

gKl{z) 

2*3  (z) 

•••  2*(*-l)(z)  gKx{z) 

,2vi(z) 

2«(z) 

gNi{z) 

•••  2v(*-l)(z)  gMf(z)_ 
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with  GCD(g/cK,g(K+\)K,  •  •  •  >  gm)  =  1 ,  deg  giK  <  deg  gJK  for  K^i  <  j^N. 
Furthermore,  gu(z')  can  be  either  0  or  a  non-constant  polynomial  (ie., 
deg  gki  >  l)for  K^k^N  and  l  ^  l -  l,  and  gNl(z)  =  ■  ■  ■  =  gN(L-i)(z)  =  0, 
1  ^  deg  <  deg  gN{L+i)  <  •••  <  deg  gm  for  some  L  where  1  ^L<K. 

Proof.  By  Theorem  2,  if  g,-,(z)  is  not  a  non-zero  constant  for  1  <  i  —  1,  then 
gji(z)  =  gu(z)hji(z)  for  i  <  j^N.  For  example,  assume  /'=  1.  Then 


■  1 

0 

0 

0  - 

h21  (z) 

£22  (?) 

0 

0 

h/ci  (z) 

gKt{z) 

gKi(z)  ... 

gfcxiz) 

.hN\{z) 

gNl{z) 

gNi(z)  ■  ■  ■ 

gm{z) . 

which  contradicts  with  the  irreducibility  of  G(z)  because  the  leftmost  matrix  is 
not  unimodular.  Similar  arguments  can  be  used  to  prove  that 
. . .  ,gNK)  =  l.  When  gki(z)  is  a  non-zero  constant  for  some 
k,l  with  K  and  1  <  /  < K  —  1 ,  it  can  be  reduced  to  zero  by  implementing 

a  constant  elementary  row  operation,  i.e.,  gu{z)  can  be  reduced  to  zero  by  an 
AR-equivalence  transformation.  □ 

Remark  3.  The  result  in  Theorem  3  is  the  simplest  form  we  can  have,  which 
cannot  be  improved  further.  For  example,  we  can  directly  check  that 

'1  0  ‘ 

z  z2 

y  **+1. 

is  an  irreducible  matrix,  we  cannot  simplify  it  further  under  AR-equivalence 
transformations.  This  polynomial  matrix  is  actually  an  AR  matrix  from  The¬ 
orem  5  of  the  next  section. 

Definition  5.  If  A(z)  is  AR-equivalent  to  G{z)  of  the  form  (2.2),  then  Gfz)  is 
called  the  canonical  form  of  A(z).  If  gKK  =  1,  as  indicated  in  Section  1,  we  call 
G(z)  the  systematic  form  of  A(z). 

By  the  above  results,  we  can  easily  classify  irreducible  matrices  as  well  as  AR 
matrices.  So,  to  study  the  AR  property  of  a  polynomial  matrix,  we  only  need  to 
study  its  canonical  form  or  systematic  form. 
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3.  (K  + 1)  x  K  AR-matrices 

In  the  above  sections  we  have  discussed  the  classification  of  AR  matrices 
and  it  was  shown  that  every  N  xK  AR  matrix  is  of  form  (2.2).  In  this  section, 
we  present  the  sufficient  and  necessary  conditions  for  a  (K  4-  1)  x  K  matrix  to 
be  AR.  These  conditions  can  be  used  in  the  design  of  error  control  codes  in 
applications. 

We  first  see  the  simplest  form,  i.e.,  the  systematic  form  as  follows  (also  see 
Ref.  [8]). 


Theorem  4.  If  G{z)  has  systematic  form,  i.  e. , 


G(z)  = 


1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

1 

.g(K+l)l(z) 

g(K+  \)l{z) 

g(K+l)3(z)  •  •  • 

g(x+i)x(z). 

(K+l)xK 


then  G(z )  is  AR  if  and  only  if  { 1 ,  g(x+i)i  (z),  g(x-H)2(z),  g(x+i)3(z),  •  •  •  ,g(x+i)*(z)} 
are  linearly  independent  over  F. 

Proof.  Let  N  =  K  +  1 .  We  first  prove  the  necessity.  If 
{I,gvi(z),gv2(z),gv3(z),...,gvx(z)}  are  linearly  dependent,  then  there  exists 
k  €  {1,..., AT}  such  that 

K 

gmf)  =  c  +  ^  CigNiiz)- 


Hence,  there  exists  an  AR-equivalence  transform  that  transforms  G(z)  into  a 
matrix  with  its  first  column  as  (1,0, . . .  ,0)T.  Proposition  1  means  G{z)  is  not 
AR.  Thus 


{gM(z),gv2(z),g,V3(z),  •  •  •  ,£v*(z)} 

are  linearly  independent. 

We  now  prove  the  sufficiency.  Under  AR-equivalence,  we  may  assume 
1  ^  deg  gN\  <  deg  £«<•••<  deg  gNK.  By 
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e\\ 

...  eXN  ' 

'  1 

0 

0 

1 

o  - 

0 

eN\ 

•  •  ■  £nn  . 

0 

■  gn  l(z) 

0 

gNl(z)  ■  ■ 

1 

•  gm{z). 

’  1 

0 

0 

1 

0  - 

0 

~vn(z)  .. 

•  ViK (z)  ‘ 

0  0  ... 

-gN\(z)  gN2(z)  ... 

1 

gm(z). 

vKX(z)  .. 

•  vkk(z). 

we  obtain 

eu  +  eiNgNj{z)  =  v,j(z)  for  ij  —  1,2,...,  AT,  (3.1) 

K 

eNj  +  eNNgNj(z)  =  'Yjkj(z)gNk{z)  for  j  =  1,2, . .  .,K.  (3.2) 

*=1 

First,  from  Eq.  (3.1)  and  Eq.  (3.2)  we  obtain 

K-\ 

eNj  +  eNNgNj(z)  =  y ^(ekj  +  ekN’gNj{z))gNk{z)  +  (ejry  +  eKNgNj{z))gm{z). 
k= 1 

Taking  j  —  AT  we  have 


AT-I 

+  emgmciz)  =  +  ew*w(r))*»(r)  +  (««  +  «jw®«r(*)  )*«(*). 

*=1 

Comparing  the  highest  coefficients  of  the  two  polynomials  we  have  =  eNN 
and  eks  =  0  for  any  k  =  1, . . . ,£.  Hence  vu{z)  =  eu  is  in  fact  a  constant  for  any 

\'J  =  1 . K-  Since  1  < degfevi )  <  deg(s.v2 )  <  deg {gNK),  l,gm,...,gNK  are 

linearly  independent.  By  Eq.  (3.2)  again  we  have 

K 

e* i  +  enNgNj{z)  =  y^vkjgNk(z)  for  j  = 

*=  i 

Hence  v„  =  e0  =  0  except  possibly  vn  =  en  =  em  for  j  =  1,2,. . .  ,K.  This  is 
exactly  what  we  need.  □ 

We  now  consider  the  general  canonical  form  (2.2)  G(z )  with  N  =  K  +  1.  We 
have  the  following  necessary  and  sufficient  conditions  for  all  possible 
(K  +  1 )  x  K  AR  matrices. 
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Theorem  5.  Let  G(z)  have  the  cononical form  (2.2)  with  N-K+  1. 

(a)  If  1  ^  deg  gm  <  deg  gm  <  •  •  •  <  deg  gNK,  then  G(z)  is  AR  if  and  only  if 
GCD{gKK,gm)  =  1  -In  this  case,  irreducibility  and  ambiguity  resistance  are  the 
same. 

(b)  If  gm(z)  =  •  •  •  =  g.v(i-i) (z)  =  0, 1  <  deg  gm.  <  •  •  •  <  deg  gm ,  then  G(z)  is 
AR  if  and  only  if  GCD(gjoc,g/v*r)  —  1j  0>g/aigK2>  •  •  •  :gK(i-\)>g\L.  •  ■  ■  ■  gw(A'-i)} 
are  linearly  independent  over  F,  and  W\  n  W2  —  {0},  where 

Wx  -  span{gATjc, gmgKi gmgw-i)}, 


W2  =  SP&n{gKK,gKKgK\,-  ■  ■  ,gKKgK(L-\),gKKgNL>  ■  ■  ■  >  gfCKgN(K-\)} i 
where  span  means  the  set  of  all  linear  combinations  with  constant  coefficients. 

Proof,  (a)  This  is  a  special  case  of  (b):  the  case  of  L  =  1.  If  GCD(gA«,  gm)  =  1 
and  1  ^deggAn  <  •  ■  •  <  degg/w,  we  can  easily  check  that 
span{gMr,gAflcgjci.  ■  •  •  ,gNtcgK(K-i)}  n  span{g^,g/xgm,-  •  •  ,g*Kgv(K-i)}  =  M- 
So  we  only  need  to  prove  (b). 

(b)  EG(z)=  G(z)V(z)  we  get  the  following  equations: 


eij  +  eucgKj(z)  +  eiNgNj{z)  —  Vij(Z)i  1 

-i, 

(3.3) 

K 

eg]  +  € KKgKjiz )  +  eKNgNj{z)  —  S)'JgKm(z)Vmj{z)  1 

m=  1 

(3.4) 

K 

e Nj  +  eNKgKj{z)  +  emgNj(Z)  =  'Sy^JgNm{z)Vmj{z), 

m—  1 

1  <./<*- 1, 

(3.5) 

eucgiaci2)  +  eiNgm(z)  —  Vik{z)i  1  ^  i  ^  K  —  1, 

(3.6) 

K 

eKKgKJ((z)  +  ^KNgNg(z)  —  ^  'jgKm  (z)  VmK  (z)  > 
m- 1 

(3.7) 

K 

^nkSkk{z)  +emgNic(2)  — 

(3.8) 

m~\ 


Substituting  Eq.  (3.6)  to  Eq.  (3.8)  we  obtain 

K~\ 

emgtctfz)  +  emgm(z)  =  ^gvm(z)(em^g^(z)  +  e^gm^))  +  Vm{z)gNK{z) , 
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-  Vkk{z)  -  =  [  y ^mKgNmjz)  -  eNK |to(z). 


So  GCD(gNKigKx)  =  1  implies 

^(Z)|  ('^2emKgNm{z)  -  6®]  . 


Hence  1  <  deg  gNL  <  ■  ■  ■  <  deg  gNK  implies  eNK  =  0  and  eiK  =  0  for 
/  =  I,. . .  ,K  -  1  and 

K-\ 

Vkk(z)  —  eNN  ~  (3-9) 

m~  1 

Plugging  Eqs.  (3.6)  and  (3.9)  into  Eq.  (3.7)  we  get 
—  'y^fmNgKm  (z)^  gNK  ) 

=  •  —  6/tff  +  ^~^(g mXgKm  ~  emNgNm{z))^J  gfcfc(z), 

or 

~  ^^mNgKm  (? )  ^  gSK  ) 

=  ( CNN  ~  CfCK  +  y^/mKgKm  “  X>  jto(z)-  (3.10) 

\  m=  1  m=i  / 


Now  ^  fl  =  {0}  and  the  linear  independence  of 
•  •  •  igK(t-\),gNL,  •  •  •  ?&v(i:-i)}  mean  eKK-eNN,  eiK=0  for 
/  =  1 , . . . ,  L  -  1 ,  emN  —  0  for  m  =  L, . . . ,  K  -  1 .  So 

K- 1  /.-I 

“  'S)'emNgKm{z)  =  ^KA’  ~  ^^mA !gKm{z)  =  0 
/n=1  m= 1 

implies  =  0,  emN  =  0  for  m  =  1 , . . . ,  L  -  1  by  the  linear  independence  of 
{ 1  ,gK\ i * •  •  > £jr(i-i)}.  Hence  we  obtain  eiN  =  0  for  i  =  1,2,. ...AT,  =  0, 

=fiw,  eIK  =  0  for  i  =  1, 2, . . . ,  K  -  1.  Then  Eq.  (3.3)  becomes 

Vjj(z)  =  etj,  i,j  =  1 , 2, . . . ,  K  -  1 , 


(3.11) 
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Eq.  (3.4)  becomes 

eK j  +  eKKgKj{z)  =  ^2gKm(z)vmJ(z),  1  ^j^K-  1,  (3.12) 

m=  1 

Eq.  (3.5)  becomes 

* 

eNj  +  emgNj(z)  =  ^&V*(z)MZ)  =  YjNm{z)vmj{z), 

m= 1  m=L 

1  AT  -  1.  (3.13) 

Plugging  Eq.  (3.11)  into  Eq.  (3.13)  we  get  e m  =  eyy  =  uyy,  t>,y(z)  =  e,y  =  0  for 
i  =  L,...,K,  j  -  1,2 ,...,AT  -  1,  i  ^ /  1°  this  case  Eq.  (3.12)  becomes 

Z.-1  A 

eKj  +  €KKgKj{z)  =  y^gAw(^)g/n>  + 

/n=l  m=Z, 

This  means  that  e,*,  =  0  if  /  =  1, . . . ,Z  —  1,  y  =  1,2, . . . , K  -  1,  i  ^j.  This 
proves  that  E  =  en/*+j,  V(z )  =  eu/*. 

We  now  prove  the  necessity.  If  G(z)  is  AR,  it  is  obvious  that  we  require 
GCD(g**,to)  =  1  and  {l,gKU  •  •  •  ,£a(i-i)}  are  linearly  independent.  Now  if 
fFj  n  FF2  /  {0},  Eq.  (3.10)  implies  that  we  can  find  non-trivial  solution,  i.e., 
there  exists  elK  /  0  for  some  1  <  i  <Z,  —  1.  Hence  we  conclude  that  V(z)  ^  cdK . 
This  contradicts  with  the  AR  property  of  G(z).  □ 

Remark  4.  In  Theorem  5,  if  gKx(z)  =  1  and  gKj{z)  =  0  for  1  K  -  1,  it  is 
exactly  Theorem  4. 

By  Theorem  5,  we  can  also  see  that  if  the  field  F  is  the  complex  field  C,  and 
the  degree  of  the  polynomial  matrix  of  size  (K  +  1)  x  K  to  be  bounded  by  some 
integer  M,  then  the  set  of  (K  +  1)  x  K  polynomial  matrices  that  are  not  AR  has 
measure  0  in  the  finite  dimensional  linear  space  consisting  of  all  (K  +  1)  x  K 
polynomial  matrices  whose  degrees  are  bounded  by  M.  This  means  that  a 
randomly  generated  polynomial  matrix  is  AR  with  probability  1.  Hence  we 
have  the  following  result  which  confirms  the  conjecture  made  in  Ref.  [10]. 

Corollary  1.  A  randomly  generated  (AT  +  1)  x  K  polynomial  matrix  over  the 
complex  field  C  is  almost  surely  AR. 

The  following  corollary  gives  an  intuitive  construction  of  a  family  of  AR 
matrices. 
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Corollary  2.  If  gn i (z)  =  •  •  •  =  gN(L-i)(z)  =  0,  1  < deg gm.  <■••  <  deg gNK, 
GCD(gKK,gNK)  =  l,  {\,gKUgKi,  ■  •  ■  i8k(l-\)iEnl,  ■  •  •  ,^jv(a:-i)}  are  linearly  in¬ 
dependent  over  F  and  if  deg  gsx  >  deg  ggj  for  1  —  1,  then  G(z )  is  AR. 

Proof.  Let 

W\  =  span {gm,gmgK\,-  •  • , ^a«t^a:(a:-i)}? 

^2  =  span{to,  togATI,  •  •  *igKKgK{L-l)igKKgNL>  •  ■  •  ,  £xk£V{*-1)  }• 

We  only  need  to  prove  W\  n  W2  =  {0}.  Now  let 

K-\  L- 1  K- 1 

'Y'zjgNKgKj  =  YfjgKKgKj  +  y ^PjgKKgNj 
j—  i  J=L 

i.e., 

^  f  =  to  ( Y,pjgKJ + Ypjgvj 

\j=\  )  \j=  \  j=L 

By  GCD(gKK,gNK)  =  1  we  get 

(£-i  *-t 

'Y^PjSV  +  YlPigNi 

Ml  ML 

So  deg  >  deg  gKj  and  deg  gNL  <  ■  ■  ■  <  deg  gNK  induce 

Eta + Eta  =  o 

j=\  j=L 

and  hence  W\  D  W2  =  {0}.  □ 

It  is  natural  to  ask  the  following  question:  if  G{z)  of  size  N  x  K  (N  >  K)  is 
AR,  H(z)  is  an  M  x  K  polynomial  matrix,  is  polynomial  matrix 

G(z)' 

H{z). 

AR?  The  following  example  provides  a  negative  answer  to  this  question. 

Example  1.  Let 


By  Theorem  4  we  see  that  G(z)  is  AR.  Let 
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G(z) 


'1  O' 

0  1 
z  z2 
.0  z  ^ 

We  can  easily  check  that 


"1 

0 

-1 

O' 

'1 

O' 

'1 

O' 

1 

1 

0 

1 

0 

1 

0 

I 

0 

0 

1 

0 

z 

z2 

z 

z2 

.0 

0 

1 

1. 

.0 

z . 

.0 

z . 

—  z  — z2 

1  1  +z. 


Hence  G(z)  is  not  AR. 

However,  we  have  the  following  property  [6], 

Proposition  2.  If  an  M  x  K  polynomial  matrix  A(z)  is  AR-equivalent  to 

■G(zY 

0 


and  G(z)  is  AR,  then  A(z)  is  AR. 


Proof.  We  only  need  to  prove  that  if  G(z)  is  AR  of  size  N  x  K,  then 

'  G{z)  ‘ 

0 

is  AR.  By  equation 


E\\  En 

G(z) " 

G(z)‘ 

Er\  E22 . 

0 

0 

we  get  EuG(z)  =  G(z)F(z),  so  G(z)  is  AR  concludes  V(z)  =  aJK  for  some 
nonzero  constant  a.  □ 


In  Section  3  we  completely  characterized  (K  +  1)  x  K  AR  matrix.  However, 
the  sufficient  conditions  for  general  N  x  K  polynomial  matrices  to  be  AR  are 
not  yet  clear.  Another  interesting  question  is,  if  G(z)  is  an  N  x  K  (N  >  K  +  1 ) 
AR  matrix,  can  we  always  find  an  ambiguity  resistant  (K  +  1)  x  K  submatrix 
H(z)  among  the  AR-equivalence  class  of  G(z)? 

Finally,  as  pointed  out  by  one  of  the  referees,  some  of  the  results  in  this 
paper  also  apply  to  finite  fields.  For  instance,  let  us  consider  Lemma  3.  Since 
the  degrees  of  f  for  1  <  i  <  L  are  bounded,  using  a  simple  non-topological 
counting  argument,  Lemma  3  is  also  true  for  a  sufficiently  large  finite  field. 
Since  Lemma  3  plays  a  main  role  in  the  proof  of  Theorems  2  and  3,  we  believe 
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that  if  the  degrees  of  polynomial  matrices  are  bounded  and  the  matrix  size  is 
fixed,  Theorems  2  and  3  is  also  true  for  a  sufficiently  large  finite  field.  Never¬ 
theless,  we  think  that  the  results  in  Theorems  2-5  may  not  hold  for  general 
finite  fields  when  there  is  no  restriction  on  polynomial  matrices. 
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which  compeltes  the  derivation  with  T  ‘  L::)  =  V._  /  - 

L  L  ,  L 

Equation  <I4i  indicates  that  the  error  made  in  approximating  the 
22  block  in  L  is  proportional  to  A’  L::  i:  however.  T  is  a  function 
of  L:_  through  *»  and  the  I  -norms  of  P  and  /  -  L,  .  L," ;  L  _D 
To  partiallv  examine  the  behavior  of  T.  we  argue  that  decreasing 
A  L:;  in  a  natural  manner  decreases  this  quantitv  as  well  Assume 
that  0  is  fixed  and  that  A  is  decreased  bv  decresing  the  value  of 
the  k  parameter  in  the  prior  model,  specifically  for  the  coefficients 
in  the  22  block  Now.  it  is  not  hard  to  show  that  D~  1  — *  0  so  that 
with  D  =  L  L\: 

\\l-BD-'\U  =||  I-BD~'  +D-' -D-'  ||, 

<||/-D-'||,  +110-'  -BD-'ll. 

+  iizr'iui/  -  Bill  — °||/||.  =  i. 

Hence,  asvmptoticallv.  ||/  -  BD~X  ||i  is  independent  of  L._.  Re¬ 
ferring  to  (8b).  it  is  not  difficult  to  show  that  as  n  decreases, 
P ..  —  k'P.j  J:.  where  Po  „*j  is  the  22  block  of  the  appropriately 
permuted  form  of  Pv.  Therefore,  as  k  —  0,  both  o  and  ||Pj.»||  i  go 
to  0  Thus,  we  conclude  that  decresing  A ( L>: )  by  vary  ing  the  degree 
of  regularization  will  cause  T  —  0. 
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Efficient  Implementation  of  Arbitrary-Length 
Cosine-Modulated  Filter  Bank 

Xiqi  Gao.  Zhenva  He,  and  Xiang-Gen  Xia 


Abstract —  The  fast  implementation  of  arbitrary  -length  cosine- 
modulated  filter  bank  is  investigated.  Bv  using  the  linear  phase  propern 
of  the  prototype  filter,  a  more  efficient  implementation  structure  is 
obtained  for  the  filter  bank.  In  the  new  implementation.  2x2  lossless 
lattices  are  used  instead  of  2  x  I  ones  in  the  traditional  implementation 
with  the  number  reduced  by  half. 

I.  Introduction 

The  cosine -modulated  filter  bank  (CMFB)  has  received  much  inter¬ 
est  in  recent  years  [  I  J— {6].  It  has  two  remarkable  features:  easy  design 
and  fast  implementation.  While  the  design  of  CMFB's  has  been 
addressed  by  many  researchers,  we  deal  with  the  implementation  of 
paraunitary  CMFB's  in  this  correspondence.  Typically,  the  polyphase 
component  matrix  of  a  paraunitary  CMFB  can  be  expressed  as  the 
product  of  a  modulation  part  and  a  polyphase  part  in  terms  of  the 
polyphase  components  of  the  prototype  filter.  Based  on  such  an 
expression,  the  CMFB  can  be  implemented  through  two-channel 
lossless  lattices  and  fast  discrete  cosine/sine  transform  (DCT/DST) 
algorithms  (see,  for  example,  [1]  and  (4]).  Two-channel  lattices  are 
often  used  for  an  ^/-channel  CMFB.  Notice  that  only  half  the  number 
of  the  lattices  are  required  in  the  implementation  of  Malvar’s  CMFB. 
which  is  called  extended  lapped  transform  (ELT)  [2].  The  motivation 
of  this  correspondence  is  to  generalize  the  above  Malvar's  result  to 
other  paraunitary  CMFB's.  The  arbitrary-length  CMFB  developed  by 
Nguyen  and  Koilpillai  in  [3]  is  considered  in  this  correspondence. 

This  correspondence  is  organized  as  follows.  In  Section  II,  we  first 
review  the  arbitrary-length  CMFB  briefly.  Then,  we  show-  that  the 
four  filters  in  two  related  pairs  of  power  complementary  polyphase 
components  of  the  prototype  filter  form  a  2  x  2  paraunitary  system 
due  to  the  prototype  filter  symmetry.  In  Section  III,  a  new  expression 
of  the  polyphase  component  matrix  of  the  CMFB  is  developed.  Based 
on  it,  a  more  efficient  implementation  structure  is  obtained  by  using 
the  2  x  2  lossless  lattices  instead  of  the  2  x  1  ones  in  the  traditional 
implementation.  The  implementation  complexity  of  the  CMFB  is 
discussed  in  Section  P/. 

Notations :  Capital  and  lower  case  letters  are  used  to  denote  the 
transfer  functions  and  the  impulse  responses  of  fillers,  respectively. 
Bold  letters  indicate  vectors  and  matrices.  The  functions  |V|  and 
[j-J  round  the  value  of  j-  to  the  nearest  integers  toward  infinity  and 
minus  infinity,  respectively.  C1”  and  C1'  stand  for  the  standard 
DCT  matrices  as  defined  in  [8].  0  stands  for  matrix  whose  entries 
arc  all  zeros.  /  v  and  J.\  arc  the  .V  x  .V  identity  and  reverse  identity 
matrices,  respectively. 


Manuscript  received  August  7.  1997;  revised  August  15.  1998.  This  work 
was  supported  in  pan  by  Natural  Science  Foundation  of  China.  The  associate 
editor  coordinating  the  review  of  this  paper  and  approving  it  for  publication 
was  Dr.  Sergios  Theodoridis. 

X.  Gao  and  Z.  He  are  with  the  Department  of  Radio  Engineering,  Southeast 
University,  Nanjing,  China. 

X.-G.  Xia  is  with  the  Department  of  Electrical  Engineering,  University  of 
Delaware,  Newark,  DE  19716  USA. 

Publisher  Item  Identifier  S  I053-587X(99)02 163-7. 


1053-587X/99S  10.00  ©  1999  IEEE 


IEEE  TRASS  ACTIONS  ON  SIGNAL  PROCESSING.  VOL  47.  NO  4.  APRIL  l*W 


I  MW 


* - .  - ;  x  c/m  rz —  ; - • 

0(Z»r - «  *M  - H  T M  - HFo'Z)r 
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both  of  the  two  filters  must  be  delays.  In  nunie  l>  and  >L  Gi  ( : :  and 
G  /  *  4  f ; ;  have  different  lengths.  If  the>  are  related  to  themselves 
b\  r 2 1.  they  are  under  mode  «/:  otherwise.  they  are  under  mode  l>.  In 
mode  ' I .  one  of  the  two  filters  must  be  a  delay,  and  all  coefficients 
of  the  other  one  must  be  zeros. 


- - 1  r - -  xM  , (m)  — |  -  | 

1 - •fry.czj - «j  ;m  |— : — — 1  tM  • — - 1 

F»g  !  .W -channel  maximally  decimated  filter  bank. 

II.  The  Arbitrary-Length  CMFB 

A  A  Review  of  The  Arbitrary-Length  CMFB 

A  typical  37 -channel  maximally  decimated  filter  bank  is  shown 
in  Fig.  1.  where  Hl  (:)  and  Fkiz){  0  <  A-  <  M  -  1 )  are  the 
transfer  functions  of  the  analysis  and  synthesis  filters,  respectively. 
At  the  analysis  side,  the  input  signal  j*(/0  is  decomposed  into  37 
subband  signals  through  the  bank  of  analysis  filters  followed  by  37- 
fold  decimators.  At  the  synthesis  side.  M  subband  signals  are  passed 
through  A/ -fold  interpolators  and  recombined  into  the  reconstructed 
sienal  s  n  ■  by  using  the  bank  of  synthesis  filters. 

Let  h  n  !  denote  the  impulse  response  of  a  linear-phase  low-pass 
prototv  pe  filter  with  length  A  =  2/no  37  +  mi.  where  mo  and  m  \ 
are  integers  and  0  <  m ,  <  237  -  1.  The  37-channel  arbitrary -length 
CMFB  is  defined  as  (3) 


B  Latin  e  Stem  lure  for  a  Power  Complementary 
Pair  and  Its  Related  One 

The  power  complementary  filter  pair  G\  < :  i  and  G  t  ~i( :  sat¬ 
isfying  (3)  can  be  completely  factored  as  the  two-channel  lossless 
lattice 

/TT7  f  6  *  (  :}  1 


[G  ,/^i  i  :}  J 


Rk  /  — 


Ck.1  *4/ 

/  —a./ 


At:)  = 


=  CO>0*  /. 


/  =  0.1.2.- 


where  l .  "  and  /*'»).  0  <  A  <  3/  —  1.0  <  n  1  A  1* 
are  the  impulse  responses  of  the  Ath  analysis  and  synthesis  filters, 
respectively  The  CMFB  is  exactly  the  one  investigated  in  [1]  with 
the  length  extended  to  arbitrary  integer  value. 

Suppose  that  the  impulse  response  of  the  lowpass  prototype  filter 
is  symmetry,  then.  /*  i  n )  is  the  time-reversed  and  shifted  version 
of  hk  i,  This  relation  means  that  the  CMFB  is  paraunitary  if  and 
only  if  it  has  perfect  reconstruction  property  [9].  Let  Gt(:).A*  = 
0.1.23/  -  1  denote  the  type-1  polyphase  components  of  the 
prototype  filter  [9]  Due  to  the  symmetry  property-  of  h[n).  Gk{z) 
satisfies 

/  :"*‘G„11-i-*C).  A*  <  m .  —  1 

Gi<;>  =  < 

[  :  ‘  G  jA/  +  m  j  -  l  -  l  (  •)•  *  d  m  I 

where  Gk  < : )  =  Gi  ( :  ’ 1 ).  It  has  been  shown  in  [3]  that  the  necessary 
and  sufficient  condition  on  the  polyphase  components  for  perfect 
reconstruction  is 

Gk  I :  )Gk ( - )  4-  G,\i+ki  -  )G.\/4-t(  - ) 

=  Aj-  0<k  <  M  —  1.  (3) 

This  means  that  appropriate  pairs  of  the  polyphase  filters  are  power 
complementary.  Depending  on  the  lengths  of  the  two  filters  G*(c) 
and  G\t+k(z)  and  the  relationship  between  them,  four  classes  of 
power  complementary  pairs  can  be  distinguished  in  the  general  case 
for  arbitrary  length  prototype  filters.  The  condition  given  by  (3)  can 
be  satisfied  by  the  four  different  modes,  as  discussed  in  [3].  In  modes 
a  and  c.  the  two  filters  have  the  same  length.  If  they  are  related  by  (2), 
thev  are  under  mode  r;  otherwise,  they  are  under  mode  a.  In  mode  c. 


and  m  depends  on  the  lengths  of  the  two  filters.  For  the  case 
.V  =  2m 3/,  all  the  polyphase  components  have  the  same  length,  and 
there  is  no  restriction  on  any  angle  parameter  0*-./.  For  the  general 
case  when  the  prototype  filter  has  arbitrary  length,  there  are  different 
constraints  on  the  angles  of  the  lossless  lattices  corresponding  to  the 
four  modes  (see  (31  for  details). 

For  each  power  complementary  filter  pair,  we  can  find  a  related 
one  due  to  the  prototype  filter  symmetry  of  (2).  The  four  filters  in 
the  two  pairs  define  a  2  x  2  system.  We  define  the  following  three 
types  of  2  x  2  systems  in  terms  of  different  m  i  and  A*: 

rtii,  *  /TT?\  Giiz)  G.\/  +  mi-i-t( :) 

m  i  <  3/.  A*  <  m  i  —  1  or 

w\  >  3/.  m  i  -  3/  <  A-  <  37  —  1  (5a) 


l':' o  =  vrn 


wii  <  37. 


G.\/  +  n»j  — 1  —  k(  -  ) 

)  —  GiAZ  +  m,  -  |  -  4  (  C 

m  i  <  A*  <  3/  —  1 


j  Gk{z) 

G u+kiz )  -Gm,  _  i  : ) 

mi  >  37.  k  <  mi  —37  —  1. 


is  for  the  polyphase  filters  under  mode  b  and  mode  d .  L{^] 
and  L^3)  are  for  the  polyphase  filters  under  mode  a  and  mode  c.  It  is 
easy  to  show  that  these  systems  are  paraunitary  and  can  be  expressed 
as  the  following  2x2  lattices: 

f[°.  jWm0-iA(;)---.Rt.iA(:) 


L'i"{:)  =  l  "*°]0 

\(-:)Rk.m0\(:)- Rt.iMORki 

[l  0  ]  _ 

0  {  — *  t*4r0 

L[2\z)=Rk  mo-lMz)Rk  mo-2A(;) 


8k  m0  — 


04.0  =  0. 


*  Rkj\{z)Rk 


[1  0 
°[o  (-ir° 


£|r,,(;)=Jl».moA(;)JI».mo-iA(:) 


■Rk.i\(:)Rk.o 


1  0 
0  (_l)m0+! 
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In  mode  r  and  mode  d ,  the  corresponding  lattices  are  trivial. 

In  practice,  a  two-channel  lossless  lattice  can  be  implemented  by 
using  the  two-multiplier  structure  for  each  section  |2|.  (9).  A  2  x  I 
lossless  lattice  with  m  free  angle  parameters  and  others  set  to  be  zero 
or  n/2  can  be  implemented  by  using  2m  multipliers  and  2un  -  h 
adders.  For  the  corresponding  2x2  system,  both  the  two  numbers 
are  2m  +  1.  For  Lk  and  Lk~  ,  there  are  nn,  free  angle  parameters, 
and  hence,  2 n>{)  +  1  multipliers  and  2 m,,  +  1  adders  are  required.  For 
Lk  \  there  are  m0  4-  1  free  angle  parameters,  and  hence.  2mxl  4-  3 
multipliers  and  2m0  4*  3  adders  are  required. 

IB.  Fast  Implementation  of  the  Arbitrary-Length  CMFB 

Now,  we  consider  the  implementation  of  the  CMFB.  Considering 
the  relationship  between  the  synthesis  bank  and  the  analysis  bank  of 
the  paraunitary  filter  bank,  we  only  deal  with  the  latter.  The  polyphase 
component  matrix  of  the  analysis  bank  can  be  expressed  as  [3] 


r 


where  D  is  an  M  x  A/  diagonal  matrix  with  the  At h  diagonal 


component  dL  =  i 


c  =  < 

f  C"!. 
Ic'.v 

/.  = 

/,  = 

[  [0 

0 

D  =  < 

jo 

J 

l 

[0 

[0  0 

r  °  o 

/ /0  -  M  0 
0  0] 
oj- 

0  1 

o  o  r 


h  =1) 

/.  =  1 
In  <  M 

lo  >  A/ 

lo  <  M 

lo  >  \L 


/,  =  0 
/i  =  1 


Substituting  (8)  into  (7),  we  obtain  the  following  expression  of 
El:): 


E(:)  =  DCG(:) 


■c%  /  =2( 


*k  +  o.5) 
M 


g0'  :}  =  diag(G3(  : )  Gi(r) 
g , < ;  •  =  diag< G\/( c)  G.\/-m(-) 


G.\/-iU)),  and 

G 2  A/  —  I  (  -  )  )  - 


Based  on  (7)  and  the  power  complementary  condition  in  (3)  for 
perfect  reconstruction,  the  filter  bank  can  be  implemented  through 
a  parallel  bank  of  2  x  1  lossless  lattices  cascaded  by  the  modulation 
matrix  The  number  of  the  2  x  1  lattices  is  equal  to  the  number 
of  sutvhannels  \f  The  modulation  part  can  be  implemented  by 
fast  DCT  algorithm.  Such  an  implementation  structure  has  been 
widely  used  in  the  CMFB’s  with  .V  =  2mA/  (1],  [4].  The  linear- 
phase  property  of  the  prototype  filter  is  not  exploited  to  reduce  the 
complexify. 

Let  m  t  >  A/  -  1  =  2/0  -  / 1 ,  where  lo  is  an  integer,  and  /  j  €  { 0. 1 } . 
If  l i  is  equal  to  zero,  one  of  m  j  and  A/  is  odd,  and  the  other  even. 
If  / 1  is  equal  to  one.  both  mi  and  A/  are  odd  or  even.  By  using  the 
properties  of  cosine  function,  the  modulation  matrix  can  be  expressed 


vTu  DC\A\ o  -  BA ,  AAi  +  BA0 


G(c)  =  vTa7[(AAo  -  BA i  )g0(-:2) 

+  :-'(A.\l+B.\0)gl(-:2)}.  (10) 

Based  on  this  new  expression,  the  analysis  bank  can  be  imple- 
mented  more  efficiently.  The  implementation  structure  is  shown  in 
Fig.  2.  The  diagonal  matrix  D  only  changes  the  signs  of  the  output 
subband  signals.  The  matrix  C  is  the  type  ID  DCT  and  type  IV 
DCT  for  /i  to  be  zero  and  one.  respectively.  It  can  be  shown  that 
the  A/  x  A/  matrix  Giz )  can  be  implemented  through  a  parallel 
bank  of  2  x  2  lossless  lattices  that  are  related  to  and  some 
delays.  To  give  the  explicit  formula  of  G{z)  and  see  this  clearly, 
eight  cases  can  be  distinguished  in  terms  of  /o . / 1  and  mi ,  as  shown 
in  Table  I.  In  Table  I,  we  give  the  numbers  of  the  2  x  2  lattices  used 
in  the  implementation  of  G{z)  for  the  eight  different  cases.  Here,  we 
consider  Case  2  as  an  example.  In  this  case,  1  <  m,  <  A/  -  1.  G{ :) 
takes  the  form  of  (11),  shown  at  the  bottom  of  the  page,  where  G* 
stands  for  v/2A/ G*(  -;2).  G{:)  can  be  implemented  in  parallel 
through  the  following: 

a  delay  2>/a7G/0(  —  t2)  and  another  possible  delay 
\/2  A/  [G(  m  t  -  m/2  ( -  - 2 )  +  -  ‘  G\f  +(mi  _j  )  /  2  (  — ^ )]  when 
A/  is  even; 


G\  z )  =  v2A/ 

r  o  oo 
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" 1  G.\/  0  0 
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Fig  :  Ne<*  implementation  structure  of  the  Af-ci'annei  Chfm  uj  ^  ^Jo^sfla^es  in  addit^n  to  some  delays.  C  is  the  standard 
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u)  a  sei  of  2  x  2  parauinary  systems 

r— -[  (7i(-:2)  Gu* 

L*<:»  =V2A/|  lU .2, 


*4  —  1  -lr<  — ^ 


Lit:*  ='/,-u[;-icu^(-:a)  -s  'GaAf+m,-!-*!--  )J 

=  (,2) 

vs  here  t»i  <  <  I  A/  +  mi  —  3)/2, 

iii )  another  set  of  2  *  2  parauintary  systems  [see  <  1 3)  at  the  bottom 

of  the  page); 

where  0  <  k  <  lm,/2J  -  1-  Two  types  of  the  2  x  2  paraunitary 
systems  defined  in  Sect.on  n  are  used  in  the  parallel  implementation. 
The  total  numbers  of  the  2  x  2  paraunitary  systems  are  ( M  -  m  i  - 
1  )/2  and  [mi/2J.  respectively. 

IV.  Implementation  Complexity 
In  the  previous  section,  we  have  shown  that  the  CMFB  can  be 
implemented  through  a  parallel  bank  of  2x2  lossless  Utttces 
and  some  delays  followed  by  the  standard  DCT.  For  the  DCT  s. 
fast  algorithms  are  available  [7).  (8).  All  the  lattices  are  related  to 


Lj'*./  =  1.2.3  without  additional  multipliers  and  adders  required  for 
the  implementation.  From  Table  1.  it  is  obvious  that  the  total  number 
is  less  than  or  equal  to  M/2  for  each  case.  The  implementation  cost 
of  the  CMFB  is  that  of  about  M/2  2x2  lattices  plus  one  .U-point 
DCT  matrix  working  at  an  .U-fold  decimated  rate. 

In  the  traditional  implementation  structure  of  an  A/ -channel 
CMFB.  the  number  of  the  2  x  1  lattices  is  M.  Ignoring  die  trivial 
lattices,  which  are  under  mode  c  and  mode  <i,  the  number  is  exactly 
twice  as  that  in  the  new  implementation  structure.  Since  only  one 
additional  multiplier  and  three  additional  adders  are  required  to 
implement  the  corresponding  2x2  lattice  of  a  2  x  1  one,  the 
complexity  of  implementing  a  set  of  2  x  2  lattices  is  lower  than 
that  of  doubled  2  x  1  ones.  When  the  section  numbers  are  large, 
the  complexities  of  the  two  type  lattices  are  approached,  and  hence, 
the  cost  can  be  saved  nearly  one  half  to  implement  a  set  of  2  x  2 
lattices  instead  of  doubled  2x1  lattices. 

As  an  example,  we  consider  Case  5,  with  SI  —  2  to  see 
the  efficiency  of  the  new  implementation  structure.  In  this  case. 


_ _ [r'Gu+tl-:1)  Gm,-i-*(-c2)  1  _  [0  ;l|,(»,_c2)V(c) 

v^7|-  Gk(l;2)  — :»)l  "  ll  Ol^  (  lM 


*  G\i  +  m  - 


,(— c)R4.m0-tA(-:**fl‘ Rk .,A(-:2)R*.o|*0  (-i)n,o+l  ]‘ 


&k.  o  =  0. 
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no: 


TABLE  I 

Nl  MRFRS  or  L ATTIC  f*  is  THI  NtW  IMPLEMENTATION  StRI’CTVRF 


Case 

|  Number  of  lattices 

L,“ 

L;' 

L.’ 

1 )  6-0.  /,-  <  \f  and  m,  =  0 

0 

L  U  ’  2J 

0 

2).  /  =0.  /n  <  S4  and  m]  *  0 

.*  -J 

(  M  -  m,  i  ‘  2_ 

0 

3).  /,  =0.  /0  =  .\/ 

<  t/-l)/2J 

0 

0 

4).  /  =0.  /0  >  M 

W-  m;/2J 

0 

,<m,  -  .4/  -  1>  /  2 

5).  /,  =1.  /„<.»/  and  m,  =0 

0 

LW/:J 

0 

6).  /,  =1.  l0<\1  and  m,  *  0 

m./2j 

L(  A#  -  m, ) '  2J 

0 

7).  /,=!.  /„  =  M 

0 

0 

8).  /,=!.  /„  >  M 

0 

[(in,-  M- l)/2 

<b> 


Fit  }  C  'mpuuiR'nj)  ,omp!e\i!\  in  case  5  with  A/  =  1C  t a >  Number  of 
multiplication*  per  input  \umple.  ib>  Number  of  addition*  per  input  sample. 
Here  +  and  •  represent  the  complexities  of  the  traditional  and  the  new 
implementation  structures,  respectively. 


m  i  is  equal  to  zero.  M/2  lattices  L\~'iz).  and  type  IV  DCT  are 
used  B>  using  the  fast  algorithm  presented  in  [8],  ( M/2 )  log  ,  M  + 
\I  multiplications  and  i3.U/2)  lug_»  M  additions  are  required  to 
compute  the  M -point  DCT-IV.  The  total  cost  of  implementing  the 
CMFB  is  M  2m  4  3  4  log_,  M)/2  multiplications  and  Mi2ni»  4- 
1  4  3 log,  Mi/2  additions  per  M  input  samples.  It  is  the  same 
cost  required  by  Malvar  s  ELT  with  the  same  length  [2|.  In  the 
traditional  implementation.  M <  4m0  +  2  4-  log_,  M)/2  multiplications 
and  A/(4mi0  +  31ogJA/)/2  additions  are  required  for  M  input 
samples  (l|.  |9|  In  Fig.  3,  we  plot  the  average  numbers  of  the 
multiplications  and  additions  per  input  sample  (MPIS  and  APIS) 
versus  with  M  -  1G.  It  can  be  seen  that  by  using  the  new 
implementation  structure,  the  saving  of  operations  becomes  more 
significant  as  increases. 
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V.  Conclusion 

A  more  efficient  implementation  structure  for  a  class  of  paraunitary 
CMFB  with  arbitrary  length  has  been  developed  in  this  correspon¬ 
dence.  The  linear-phase  property  of  the  prototype  filter  is  exploited 
to  reduce  the  implementation  cost.  The  new  implementation  structure 
uses  2x2  lossless  lattices  instead  of  2  x  1  ones  with  the  total  number 
of  lattices  reduced  by  half.  The  implementation  costs  are  significantly 
saved,  especially  for  the  CMFB  with  a  large  ratio  of  the  length  to 
the  number  of  channels. 
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Abstract 

A  quantitative  analysis  is  given  for  the  signal-to-noise  ratio  (SNR)  in  the  short-time  Fourier 
transform  domain  for  multicomponent  signals  in  additive  white  noise.  It  is  shown  that  the  SNR  is 
increased  on  the  order  of  0{N/K)  where  K  is  the  number  of  components  of  a  signal,  N/T  is  the 
sampling  rate,  and  T  is  the  window  size.  The  SNR  increase  rate  is  optimal  for  given  K.  For  this 
result,  the  SNR  definition  is  generalized,  which  is  suitable  for  signals  not  only  in  the  time  domain 
but  also  in  other  domains.  This  theory  is  illustrated  by  one  numerical  example. 

1  Introduction 

Time-frequency  analysis  [11-12]  has  become  an  important  technique  in  analyzing  wideband/nonstationary 
signals  in  various  applications  including  inverse  synthetic  aperture  radar  (ISAR)  imaging  [1],  biomedi¬ 
cal  signal  analysis  [2-3],  speech  signal  analysis  [4],  and  FM  radio  communications  [5].  One  of  the  most 
important  features  of  this  technique  is  that  it  usually  increases  the  signal-to-noise  ratio  (SNR)  in  the 
joint  time- frequency  (TF)  domain.  This  is  particularly  advantageous  for  signals  which  are  difficult  to 
detect  in  the  time  or  frequency  domain  alone.  The  reason  for  this  important  feature  can  be  stated 
as  follows.  A  joint  TF  transform  usually  spreads  noise  from  one  dimension  (the  time  or  frequency) 
into  two  dimensions  (the  joint  time  and  frequency)  while  it  usually  concentrates  a  signal  in  localized 
regions  in  the  TF  plane.  A  number  of  research  results  on  the  estimation  of  time-varying  frequencies 
have  appeared,  such  as  [5-7]  with  Wigner-Ville  distributions.  However,  to  the  author’s  best  knowledge, 
there  does  not  exist  a  quantitative  analysis  for  the  SNR  increase  for  any  joint  time-frequency  transform, 
which  is  certainly  an  important  issue  in  practical  applications  in  signal  detection  by  using  thresholding. 

In  the  conventional  SNR  definition,  the  mean  power  is  taken  over  the  whole  domain  of  a  signal.  If 
the  signal  is  stationary  in  this  domain,  this  definition  works  fine.  However,  if  the  signal  is  not  stationary 
in  this  domain,  such  as  a  single  tone  signal  in  the  frequency  domain,  this  definition  is  no  longer  suitable. 
In  this  correspondence,  we  first  generalize  the  SNR  definition  so  that  it  is  not  only  suitable  for  signals  in 
the  time  domain  but  also  in  other  domains,  such  as  the  frequency  domain  and  the  joint  time-frequency 
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domain.  We  then  present  a  quantitative  analysis  of  the  SNR  increase  rate  in  the  joint  time-frequency 
domain  for  the  short-time  Fourier  transform  with  rectangular  windows,  where  multicomponent  signals 
in  additive  white  noise  are  considered.  The  main  result  can  be  stated  as  follows.  Let  K  be  the  number 
of  monocomponents  in  a  signal,  T  be  the  window  size  for  the  short-time  Fourier  transform,  and  N/T 
be  the  sampling  rate.  N  point  discrete  Fourier  transform  is  performed  in  each  window.  Then,  the  SNR 
in  the  joint  time-frequency  domain  is  increased  on  the  order  of  0(N/K),  when  the  window  size  T  is 
small  enough. 

This  correspondence  is  organized  as  follows.  In  Section  2,  we  formulate  a  proper  definition  for  SNR 
in  different  domains.  In  Section  3,  we  present  the  proposed  quantitative  approach  to  analyze  the  SNR 
increase  rate  in  the  joint  time-frequency  domain.  A  numerical  example  is  presented  in  Section  4  to 
illustrate  the  proposed  approach. 


2  Signal-to-Noise  Ratio  in  Different  Domains 

The  conventional  signal-to-noise  ratio  (SNR)  is  defined  as  the  ratio  of  the  mean  power  of  the  signal  over 
the  mean  power  of  the  noise,  where  the  mean  is  taken  over  the  whole  time  domain.  It  is  formulated  as 
follows.  Let  y[n]  be  a  distorted  signal: 

y[n]  —  x[n]  +  ri[n],  0  <  n  <  N  —  1,  (2-1) 


where  x[n]  is  a  signal  and  7 ][n]  is  an  additive  white  noise  with  variance  a2.  The  SNR  is  defined  as: 


SNR  = 


En=0X  1*M12 

No2 


(2.2) 


This  SNR  is  used  quite  often  in  describing  the  noise  level  relative  to  the  signal,  and  in  distinguishing 
the  signal  from  noise  in  stationary  environments.  When  the  SNR  is  too  low,  in  general  it  is  impossible 
to  distinguish  the  signal  x\n\  from  y [n] .  However,  for  some  special  kinds  of  signals  x [n] ,  such  as  narrow 
band  signals,  it  is  possible  to  detect  the  signal  in  the  Fourier  transform  domain  even  when  the  SNR  is 
of  negative  dB.  An  example  is  shown  in  Fig.  1,  where  the  SNR=  -lldB  and  the  signal  x  is  a  single 
tone  signal. 

According  to  the  SNR  definition  in  (2.2),  an  orthogonal  transform  does  not  change  the  SNR,  i.e., 
the  SNR  in  the  transform  domain  is  exactly  equal  to  the  SNR  in  the  time  domain.  This  is  because 
of  the  energy  preservation  property  of  orthogonal  transforms.  This  implies  that  the  SNR  of  the  signal 
in  the  frequency  domain  in  Fig.  1(b)  is  still  — lldB.  However,  one  can  clearly  see  the  signal  in  the 
frequency  domain.  This  suggests  that  the  SNR  definition  in  (2.2)  is  not  proper  to  judge  the  possibility 
of  detecting  the  signal  in  the  frequency  domain  in  Fig.  1(b).  It  should  not  be  surprising  since  the  signal 
in  Fig.  1(b)  is  not  stationary  and  the  mean  power  over  the  whole  frequency  domain  is,  of  course,  not 
proper  to  the  signal  with  a  single  spike. 

The  above  observation  suggests  that  the  SNR  definition  is  transform  domain  dependent  and  should 
relate  to  the  bandwidth  of  a  signal  occupied  in  that  domain.  We  now  introduce  the  following  SNR 
definition  in  a  domain. 
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Suppose  the  expression  (2.1)  is  already  in  a  transform  domain,  where  n  is  the  discrete  variable  in  the 
transform  domain.  Assume  the  additive  white  noise  rj[n]  in  (2.1)  occupies  the  full  band  in  the  transform 
domain.  For  the  signal  x [n]  of  length  N,  0  <  n  <  N  —  1,  let 


B  =  {n  :  0  <  n  <  IV  —  1  and  |x[n]|2  >  0.5Q<nmx  ^  |x[n]|2}, 


(2.3) 


where  the  number  0.5  comes 
the  SNR  is  defined  as 


from  the  common  3dB  bandwidth  definition  in  communications. 


SNR  = 


Eng B  I^MI2 

’ 


Then, 

(2.4) 


where  \B\  denotes  the  cardinality  of  the  set  B.  Notice  that  this  definition  is  similar  to  the  SNR  definition 
in  communications,  where  the  signal  is  only  considered  in  its  bandwidth. 

One  can  clearly  see  that  the  SNR  in  (2.4)  is  always  greater  than  or  equal  to  the  SNR  in  (2.2) 
because  the  mean  in  (2.4)  is  only  taken  over  the  first  large  values  in  the  whole  domain.  With  the  SNR 
definition  in  (2.4),  the  SNR  in  the  time  domain  for  the  signal  in  Fig.  1(a)  is  -8.4dB  but  the  SNR  in 
the  frequency  domain  for  the  signal  in  Fig.  1(b)  is  16.3dB.  Although  about  2.6dB  SNR  is  increased 
over  the  original  definition  in  '(2.2),  the  SNR  in  the  frequency  domain  is  significantly  better  than  the 
old  SNR,  that  is  — lldB,  in  describing  the  signal  characteristics  over  the  noise.  The  time  domain  SNR 
increase  is  consistent  for  relatively  stationary  signals  without  dramatic  jumpings  in  the  time  domain. 


3  Signal-to-Noise  Ratio  in  the  Joint  Time-Frequency  Domain 

In  this  section,  we  analyze  the  SNR  in  the  joint  time-frequency  domain  for  the  short-time  Fourier 
transform,  where  the  SNR  defined  in  (2.4)  is  used.  In  order  to  do  so,  we  first  describe  a  multicomponent 
signal  model. 

3.1  Multicomponent  Signal  Model 

Throughout  the  rest  of  this  paper,  we  use  the  following  multicomponent  signal  model, 

K 

y(t)  =  YlXkW  +  *?(*)>  0 ^  1  ^ T«>  (3-1) 

*= l 

where  we  have  the  following  assumptions 

(i)  t  is  the  continuous-time  variable  and  limited  in  the  finite  observation  interval  [0,To]. 

(ii)  r](t)  is  an  additive  white  noise  process  with  mean  0  and  variance  a2.  It  is  not  differentiable  at  any 
time  t  G  [0, To]  and  independent  of  £*(£),  1  <  k  <  K. 

(iii)  For  each  k,  1  <  k  <  K,  Xk{t)  is  a  monocomponent  time-varying  signal,  i.e., 

Xk(t)  =  A*(<)e*‘«,  (3.2) 

where  Ak(t)  is  the  slowly  varying  amplitude  envelope  of  z*(f),  and  4>k{t)  is  the  phase  of  Xk(t). 
The  magnitude  of  the  first  order  derivative  A'k(t)  is  upper  bounded  by  A *,  i.e.,  |A^(t)|  <  Ak 
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for  a  positive  constant  Ak,  and  the  magnitude  of  the  second  order  derivative  <?k(t)  is  also  upper 
bounded- by  fa,  i.e.,  \(f>'k(t)\  <  fa  for  a  positive  constant  fa  for  all  f  G  [0,T0]. 

(iv)  The  K  instantaneous  frequencies  <frk(t ),  1  are  distinct. 

Additional  details  on  multicomponent  signals  can  be  found  in  [8].  It  can  be  easily  shown  that  the 
process  y{t)  in  (3.1)  has  locally  stationary  behavior  [9-10]  in  the  following  sense 

I  Ryy(t  +  U,  S  +  u)  -  Ryy{t,  S)  |  <  C\ll\,  (3-3) 


for  a  positive  constant  C,  where  Ryy  denotes  the  autocorrelation  function  of  2/(0- 

As  a  remark,  the  nondifferentiability  assumption  (ii)  of  77(f)  makes  sense.  An  example  of  such 
processes  is  the  Wiener  process,  see  for  example  [13].  This  assumption  implies  that  any  sampled 
segment  of  77(f)  in  any  time  interval  is  a  white  noise  and  has  flat  Fourier  spectrum. 


3.2  Short-Time  Fourier  Transform  for  Multicomponent  Signals  and  SNR  Calcula¬ 
tions 

For  each  monocomponent  signal  xk(t)  in  (3.1),  by  (i)-(iii)  it  can  be  shown  that  there  exists  e*  >  0  such 
that,  for  any  s  €  (c*,To  -  ek), 

xk{s  +  f)  «  A*(s)e^(3)+^(s)t),  f  G  [-efc,e*], 

where  tiie  linear  term  A^(s)t  of  f  does  not  appear  because  of  the  “slowly  varying”  assumption  in  (iii)  on 
the  amplitude  envelope  A*(f).  Since  we  have  only  finite  many  monocomponent  signals  xk(t)  in  (3.1), 
there  exists  t  =  min{efc,  1  <  k  <  K}  >  0  such  that,  for  any  s  €  (e,T0  -  e)  and  any  k,  1  <  k  <  K, 

xk(s  +  f)  *  A*(s)e^(s)+^Wt),  f  G  [-e,  e],  (3.4) 


where  f  depends  on  the  constants  T0,  Ak,  fa,  1  <  k  <  K. 

With  (3.4),  at  each  time  s  €  (e,To  -  e)  we  apply  N  point  discrete  Fourier  transform  (DFT)  for  the 
signal  y(t)  for  t  €  (s  -  4-  f]  with  the  sampling  rate  N/T  for  T  =  2e.  For  convenience,  we  assume 

N  is  even.  The  DFT  is 

1  N I2  T21 

Py[m,l]  =  -j==  £  y{(m  +  q)-)e-^-,0<l<N-l,  (3.5) 

vN  q=-N/ 2+1 

where  m  is  in  the  range  such  that  (m  —  N/2  +  l)T/N  >  0  and  (m  +  N/2)T/N  <  To,  i.e., 


N  —  2 
2 


The  above  Py  can  be  decomposed  into 

Py[mJ]  =  J2  P*kM  +  PriMi  0<1<N-1,  — <m<(jr-  -)N,  (3.6) 

k- 1 
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(3.7) 


where  PXk  [m,Z]  and  are  defined  for  Xk{t)  and  rj(t): 

1  rp  .  i 

PXk[m,l]  =  - 7=  53  :r*((m  +  9)Tr)e”^a'>  0  <  z  <  N~  1> 

V-'V  9=-AT/2+l  J 

1  N/2  T  2  l 

Pt][n »,/]  =  -=  S  7?((m  +  9)-)e-^,  0</<iV-l,  (3.8) 

ViV  q=—N/2+l  1 

Since  57(f)  is  a  white  noise  process,  for  each  m  the  Fourier  spectrum  J5(|P^[m,  Z]|2)  are  flat  over  the  whole 
frequency  domain  0  <  l  <  N  —  las  mentioned  in  Section  3.1.  This  implies  that,  the  mean  power  of  the 
noise  spectrum  P,][m,l ]  is  also  a2,  which  is  the  same  as  in  the  time  domain. 

We  next  want  to  study  the  mean  power  of  PXk[m,l]  for  the  signal.  Using  (3.4), 


PXk  [m,  /] 


a  (  T  \ 
Ak(m—)e  1 


Therefore, 


"  N' 

N  q=-N/  2+1 


\PXk[m,l}\  «  \Ak(m^-)\^/N6  (l  - 


( m4r)T-2nl 


By  the  assumption  of  distinct  instantaneous  frequencies  for  1  <  k  <  K,  the  Fourier  power 

spectrum  |Prfc[m,/]|2  are  located  at  K  different  frequencies  4>k{mjj)Tl(2it),  1  <  k  <  K.  This  implies 


E  (l  - 


NE  I  I 


Jkirn&T 


(3.10) 


Therefore,  for  each  fixed  time  s  =  rrij,,  in  the  frequency  domain, 


Elpxfc[m,/]  >A^E  Jj) 


(3.11) 


Now,  let  us  come  back  to  the  time  domain  signal  y(m ^).  The  noise  mean  power  is  a2.  The  signal 
power  at  each  time  t  =  is 


5>(m|)  2  <  (E  Ak(m^)  )2<K±  Ak(m§) 

k= 1  \/c=l  /  k—l  1 


(3.12) 


By  comparing  (3.11)  with  (3.12),  it  is  clear  that  the  following  relationship  between  the  SNR^  in  the 
joint  time-frequency  domain  of  (3.6)  and  the  SNRt  in  the  time  domain  of  (3.1)  at  the  sampling  points 


^>05^ 
SNRt  -  5ir 


(3.13) 
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where  0.5  comes  from  the  SNR  definition  in  (2.3)-(3.4).  Therefore,  as  the  window  size  T  is  small  enough 


SNR t1>0 
SNRt  “ 


(3-14) 


Notice  that  the  assumption  of  small  enough  window  size  T  is  equivalent  to  the  assumption  of  fast  enough 
sampling  rate  N/T.  The  derivation  of  (3.14)  implies  the  following  theorem. 


Theorem  1  For  a  multicomponent  signal  with  K  many  monocomponents,  the  SNR  in  the  joint  time- 
frequency  domain  with  the  short-time  Fourier  transform  with  the  rectangular  window  of  size  T,  and 
the  sampling  rate  N/T,  increases  over  the  SNR  in  the  time  domain  on  the  order  ofO(N/K)  when  the 
sampling  rate  is  fast  enough.  Given  the  number  K,  this  increase  rate  0(N/K)  is  optimal. 


Proof:  The  first  part  has  been  proved  by  the  above  argument.  The  optimality  can  be  proved  by  taking 
Ak(t)  =  1  and  0fc(<)  =  ckt2  for  proper  constants  c*  ^  0  for  1  <  k  <  K,  and  noticing  that  the  inequalities 

in  (3.9)-(3.12)  become  equalities  in  this  case.  □ 


4  Numerical  Example 

For  simplicity  in  computations,  we  choose  the  following  two-component  signal  model 

y{t)  =  e^2  +  +  f?(t),  0  <  t  <  2,  (4.1) 

where  r/(<)  is  an  additive  white  Gaussian  noise  with  mean  0  and  variance  <r2  =  9.  The  window  size  for 
the  short-time  Fourier  transform  is  1/8.  The  following  constant  of  the  SNR  increase  rate  in  terms  of 
the  number  of  points  N  of  the  DFT  is  illustrated  in  Fig.  2. 

SNRf/  JN_  (4.2) 

SNRt  /  K' 

One  can  see  that,  for  this  particular  signal, 

Hf-0'55!-  -*->»•  (4-3) 

From  Fig.  2,  one  can  also  see  that  the  constants  of  the  SNR  increase  rate  have  large  variance  when  the 
sampling  rate  is  not  large  enough  but  almost  become  invariant  when  the  sampling  rate  becomes  large. 

5  Conclusion 

In  this  paper,  we  have  quantitatively  analyzed  the  SNR  increase  rate  in  the  joint  time-frequency  domain 
with  the  short-time  Fourier  transform  over  the  SNR  in  the  time  domain  for  multicomponent  signals  in 
additive  white  noise.  We  have  shown  that  the  rate  of  the  SNR  increase  is  on  the  order  of  0(N/K),  where 
K  is  the  number  of  monocomponents  in  a  signal,  N/T  is  the  sampling  rate  and  T  is  the  window  size  in 
the  short-time  Fourier  transform.  Although  we  have  presented  quantitative  analysis  for  the  short-time 
Fourier  transform  with  rectangular  window  functions,  we  believe  that  the  result  also  holds  for  Gaussian 
window  functions. 
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Figure  Captions 


Fig.  1  :  Single  tone  signal. 
Fig.  2  :  SNR  increase  rate. 
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Figure  1:  Single  tone  signal. 
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the  constant  in  the  SNR  increase  rate  0(N/K) 


Figure  2:  SNR  increase  rate. 
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Convergence  of  An  Iterative  Time- Variant  Filtering  Based  on 

Discrete  Gabor  Transform 

Xiang-Gen  Xia*  Shie  Qian* 


Abstract 

An  iterative  time- variant  filtering  based  on  discrete  Gabor  transform  (DGT)  has  been  re¬ 
cently  proposed  by  the  authors.  In  this  paper,  we  present  a  proof  of  the  convergence  of  the 
iterative  algorithm  under  a  sufficient  condition  on  the  analysis  and  synthesis  window  functions 
of  the  DGT.  In  the  meanwhile,  we  show  that  the  iterative  algorithm  refines  the  least  squares 
solution. 


1  Introduction 

Time-frequency  (TF)  transforms  (or  analysis)  add  redundancy  in  the  joint  TF  domain  to  the 
signal  in  the  time  domain.  They  spread  noise  over  the  whole  TF  plane  and  meanwhile  contain  the 
signal  information  in  some  localized  areas  as  shown  in  Fig.  l(a)-(c).  Therefore,  TF  transforms 
usually  significantly  increase  the  signal-to-noise  ratio  in  the  TF  domain,  see  for  example  [19]  for  a 
quantitative  analysis.  In  other  words,  signals  in  the  TF  domain  may  be  easier  to  be  detected  than 
in  the  time  domain  alone.  With  this  observation,  one  might  use  the  following  idea  for  extracting 
the  signal  in  the  time  domain  analogous  to  traditional  linear  filtering:  take  the  TF  transform  of  a 
noisy  signal  s(<);  mask  the  TF  transform  in  the  TF  plane  as  shown  in  Fig.  1(d);  take  the  inverse 
TF  transform  of  the  masked  TF  transform  shown  in  Fig.  1  (d)  as  s{t).  With  traditional  linear 
filtering,  there  is  no  question  about  that  the  Fourier  spectrum  of  the  filtered  signal  s(t)  has  the 
desired  frequency  band.  This  is  because  the  Fourier  transform  is  a  one-to-one  and  onto  mapping 
for  finite  energy'  signals.  Any  signal  in  the  frequency  domain  corresponds  to  a  unique  signal  in  the 
time  domain.  This  is,  however,  no  longer  true  in  general  for  TF  transforms.  Not  every  signal  in 
the  joint  time-frequency  domain  corresponds  to  a  signal  in  the  time  domain  due  to  the  fact  that 
TF  transforms  are  redundant  and  not  onto.  This  implies  that  the  TF  transform  of  the  filtered  s(t ) 
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der  Grant  MIP-9703377,  the  1998  Office  of  Naval  Research  (ONR)  Young  Investigator  Program  (YIP)  under  Grant 
N00014-98-1-0644,  and  the  University  of  Delaware  Research  Foundation. 

fDSP  Group,  National  Instruments,  Austin,  TX  78730-5039.  Email:  qian@natinst.com;  Phone:  (512)794-5504. 
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may  not  fall  in  the  masked  domain  as  shown  in  Fig.  1  (d)-(e).  With  this  observation,  let  us  state 
the  general  time-frequency  synthesis  problem  (also  known  as  the  problem  of  filtering,  time- varying, 
nonstationaiy  wideband  signals).  Given  a  user  specified,  localized  time-frequency  domain  in  the 
TF  plane,  find  the  corresponding  time  domain  waveform.  The  traditional  approach  to  this  problem 
is  the  least  squares  solution  method,  which  finds  the  signal  in  the  time  domain  that  minimizes 
the  squared  error  between  the  signal’s  TF  transform  and  the  desired  one  (see,  for  example,  [1]  for 
ambiguity  functions,  [2-4]  for  TF  transforms).  For  other  approaches,  see,  for  example,  [5].  There 
are  two  drawbacks  to  the  least  squares  solution  method.  The  first  one  is  that  although  the  error 
between  the  TF  transform  of  the  solution  and  the  desired  one  is  minimized  in  the  mean  squared 
error  sense,  the  TF  transform  of  the  solution  is  not  guaranteed  to  have  the  desired  time-frequency 
characteristics.  This  means  the  solution  may  not  be  the  desired  one  as  illustrated  later  by  examples. 
As  a  result,  the  performance  is  limited,  which  will  be  seen  from  our  numerical  results  later.  The 
second  drawback  is  the  computational  complexity  when  signals  are  fairly  long,  which  is  quite  often 
the  case  in  practice.  This  is  because  the  calculation  of  the  pseudo  inverses  of  the  matrices  needed  for 
the  least  squares  solution  method  is  computationally  expensive  when  their  sizes  are  large.  Recently, 
an  iterative  time-variant  filtering  method  based  on  discrete  Gabor  transform  has  been  proposed  by 
the  authors,  see  for  example  [6,  7,  18]. 

In  this  paper,  we  present  a  proof  of  the  convergence  of  the  iterative  algorithm  proposed  in  [6,  7] 
under  a  sufficient  condition  on  the  window  functions  of  discrete  Gabor  transforms.  We  also  prove 
that,  under  these  conditions,  the  first  iteration  of  the  iterative  algorithm  is  exactly  the  least  squares 
solution.  Improvement  over  the  least  squares  solution  occurs  with  more  iterations,  which  can  be 
s<H’u  clearly  from  our  numerical  examples.  This  paper  is  organized  as  follows.  In  Section  2,  we 
first  briefly  review  discrete  Gabor  transforms,  then  restate  the  iterative  algorithm  for  time- varying 
filtering  proposed  in  [6,  7]  and  finally  present  a  proof  of  the  convergence.  In  Section  3,  we  present 
some  numerical  examples. 

2  Convergence  of  the  Iterative  Time- Varying  Filtering 

In  this  section,  we  first  describe  the  iterative  time-varying  filtering  algorithm  proposed  in  [6,  7]  and 
then  study  its  convergence. 
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2.1  Iterative  algorithm 

Let  us  first  review  the  DGT  studied  by  Wexler  and  R az  [8].  Let  a  signal  s[k],  a  synthesis  window 
function  h[n]  and  an  analysis  window  function  7[n]  be  all  periodic  with  the  same  period  L.  Then, 


M-lN-l 


s[k]  =  E  E  Crn,nhm,nH 

(2.1) 

m" 0  n=0 

L- 1 

Cm,n  =  E  s[fc]Tm,n[fc]> 

(2.2) 

k= 0 

hm,nlk]  =  h{k-mAM}W£AN-k , 

(2.3) 

'rm>n[k}  =  'r[k-mAM]W2ANk, 

(2.4) 

and  WL  -  exp(j2n/L),  j  =  -/-l,  AM  and  AN  are  the  time  and  the  frequency  sampling  interval 
lengths,  M  and  N  are  the  number  of  sampling  points  in  the  time  and  the  frequency  domains, 
M  ■  AM  =  N  •  AN  =  L,  MN  >  L  (or  AM  AN  <  L).  The  coefficients  Cm,n  are  called  the  discrete 
Gabor  transform  (DGT)  of  the  signal  s[k]  and  the  representation  (2.1)  is  called  the  inverse  discrete 
Gabor  transform  (IDGT)  of  the  coefficients  Cm,n. 

One  condition  on  the  analysis  and  synthesis  window  functions  7[fc]  and  h[k]  obtained  by  Wexler 
and  Raz  [8]  is  the  following  identity: 

L- 1 

E  h[k  +  mN]W[nMkj* [Jfc]  =  <5[m]<5[n],  0  <  m  <  AN  -  1, 0  <  n  <  AM  -  1.  (2.5) 

*=o 

Tin*  DGT  and  IDGT  can  be  also  represented  in  the  following  matrix  forms.  Let 

C  =  (C0lo, C0,u •••, s  =  (s{ 0], s[  1], •  •  • , s[L  -  1])T. 

Tin*  DGT  can  be  represented  by  the  MN  x  L  matrix  GmnxL  with  its  ( mN  +  n)th  row  and  kth 
column  element  7^n[4  The  IDGT  can  be  represented  by  the  L  x  MN  matrix  Hl*mn  with  its 
kth  row  and  (mN  +  n)th  column  element  /imtn[fc]-  Thus, 

C  =  Gmnxls  and  s  =  HLxmn C.  (2.6) 

The  condition  (2.5)  implies  that 

HlxmnGmnxl  =  hxL,  (2-7) 

where  IlxL  is  the  L  x  L  identity  matrix. 

As  mentioned  in  the  introduction,  the  oversampling,  which  corresponds  to  the  case  when  MN  > 
L,  of  the  DGT  adds  redundancy  and  is  usually  preferred  for  noise  reduction  applications.  This  can 
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be  also  seen  from  [19]  where  it  is  proved  that  the  SNR  in  the  transform  domain  for  short-time 
Fourier  transforms  increases  when  the  sampling  rate  increases.  From  (2.1)-(2.5),  (2.6)-(2.7),  one 
can  see  that' an  L  dimensional  signal  s  is  transformed  into  an  MN  dimensional  signal  C  and  MN 
is  greater  than  L  due  to  the  oversampling.  Therefore,  only  a  small  set  of  MN  dimensional  signals 
in  the  TF  plane  have  their  corresponding  time  waveforms  with  length  L.  Let  DmnxMN  denote 
the  mask  transform,  specifically,  a  diagonal  matrix  with  diagonal  elements  either  0  or  1.  Let  s  be 
a  signal  with  length  L  in  the  time  domain.  The  first  step  in  the  time-varying  filtering  is  to  mask 
the  TF  transform  of  s 

Ci  =  DmnxmnGmnxLS, 

where  DmnxMN  masks  a  desired  domain  in  the  TF  plane.  Since  the  DGT  GmnxL  is  a  redundant 
transformation,  the  IDGT  of  Ci,  HlxMnCi,  may  not  fall  in  the  mask.  In  another  words,  generally, 

GmnxlHlxMnCi  DmnxMnGmnxlHlxmnCu  (2-8) 

where  MN  >  L,  which  is  illustrated  in  Fig.  1(e).  Notice  that,  in  the  critical  sampling  case,  i.e., 
MN  =  L,  the  inequality  (2.8)  becomes  equality.  An  intuitive  method  to  reduce  the  difference 
between  the  right  and  the  left  hand  sides  of  (2.8)  is  to  mask  the  right  hand  side  of  (2.8)  again  and 
repeat  the  procedure,  which  leads  to  our  iterative  algorithm: 


So 

=  s, 

(2.9) 

Ci+i 

=  DmNxMnGmNxL^U 

(2.10) 

S(+l 

=  HlxMnCi+1, 

(2.11) 

l  =  0,1,2,.... 


The  above  iterative  algorithm  is  illustrated  in  Fig.  2. 

The  iterative  algorithm  (2.9)-(2.11)  is  an  alternating  projection  procedure.  It  is  used  quite  often 
in  signal  recovery  applications,  such  as  signal  extrapolation  and  phase  retrieval.  The  important 
issues  for  this  algorithm  are:  When  does  the  algorithm  converge?  If  it  converges,  to  what  does  it 
converge?  We  study  these  questions  in  the  next  subsection. 

Before  going  to  the  convergence,  let  us  see  what  the  least  squares  solution  is.  Based  on  the 
definition,  the  least  squares  solution  is  the  L  x  1  vector  x  that  minimizes 

\\GmnxL*  -  DmnxMnGmnxlsW  =  mm\\GMNxLX-  DmnxMnGmnxlsW,  (2-12) 

where  norm  ||  •  ||  is  the  usual  Euclidean  norm.  Then, 

x  =  {G^hi^iGmnxl)  1G]MNxLDmnxmnGmnxLS,  (2.13) 
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where  t  stands  for  the  complex  conjugate  and  transpose.  Clearly,  when  the  signal  length  L  is  large, 
the  inverse  matrix  computation  is  expensive.  Although  the  error  in  (2.12)  is  minimized,  the  DGT  of 
the  least  squares  solution  x  may  not  fall  in  the  mask  DmnxMN •  GmnxL x  /  DmnxMnGmnxL x, 
when  MN  >  L.  Note  that  when  the  analysis  and  synthesis  window  functions  are  the  same,  i.e., 
HlxMN  =  G^MNxL,  The  least  squares  solution  x  in  (2.13)  reduces  to 

x  =  HlxMnDmnxMnGmnxls , 

which  is  the  first  step  sx  of  the  iterative  algorithm  (2.9)-(2.11). 

2.2  Convergence  of  the  iterative  algorithm 

In  this  subsection,  we  want  to  have  a  condition  on  the  window  functions  /i[n]  and  7[n]  for  the 
convergence  of  (2.9)-(2.11).  We  show  that,  under  this  condition,  the  limit  of  the  sequence  in 
(2.11)  does  have  its  DGT  falling  in  the  mask  DmnxMN- 
Rewrite  (2.9)-(2.11)  as  follows 

C/+1  =  DmNxMnGmNxlHlxMnCi  =  {DmNxMnGmNxlHlxMn)  DmNxMnGmNxLS,  (2-14) 

where  l  =  1,2,....  If  we  can  prove  that  both  matrices  DmnxMN  and  the  product  GmnxlHlxMN 
are  orthogonal  projections,  the  above  iterative  algorithm  converges  by  the  alternating  orthogonal 
projection  theorem  (see  [14]).  A  matrix  A  is  an  orthogonal  projection  [14-15]  means  that  (i)  A2  =  A 
and  (ii)  -4t  =  A,  and  vice  versa. 

It  is  clear  that  the  mask  matrix  DmnxMN  is  an  orthogonal  projection.  For  the  product  matrix 
G.us xiMlxMN  we  have,  by  (2.7), 

{GmnxlHlxMN)2  =  GmnxlHlxMN,  (2-15) 

i.e.,  the  condition  (i)  for  an  orthogonal  projection  is  satisfied.  For  the  Hermitian  property  (ii),  we 
need  the  following  condition  on  the  window  functions  h  and  7  (details  can  be  found  in  Appendix): 

AN-\  AN-i 

Y,  7*[i lN  +  k)h[lN  +  k  +  mAM}=  £  h*[lN  +  k]j[lN  +  k  +  mAM],  (2.16) 
1=0  t=o 

for  k  =  0, 1, ...,N  -  1  and  m  =  0, 1  ,...,M  -  1. 

With  the  above  condition,  the  following  lemmais  not  hard  to  prove. 

Lemma  1  The  product  matrix  GmnxlHlxMN  ts  Hermitian  if  and  only  if  condition  (2.16)  for  the 
window  functions  h  and  7  holds  . 
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Proof:  To  prove  Lemma  1,  we  first  re-express  both  DGT  and  IDGT  matrices  GmnxL  and 
HlxMN  by  taking  the  advantage  of  the  forms  in  (2.1)-(2.4). 

For  1  =  0,1, ...,  M  -  1,  let  Tj  be  the  following  Lx  L  diagonal  matrix 

T,  =  diag(7*[0  -  /AM],7*[1  -  /AM],  •  •  •  ,^\L  -  1  -  /AM]),  (2.17) 

WnxN  be  the  N  points  discrete  Fourier  transform  matrix,  i.e.,  WnxN  =  (lF^mn)o<m,n</v-i-  Let 
WNxL  be  the  following  N  x  L  matrix  consisting  of  AN  many  WnxN  as  submatrices 

Wnxl  —  {WnxN,  WnxN,  •  •  •  >  Whxn)-  (2.18) 


Then,  GmnxL  can  be  rewritten  as 


/  WnxlTo  \ 
WNxlTi 


GmnxL  = 


\  WnxiXm-1  J 

Similarly,  the  matrix  HLxmn  can  be  rewritten  as 


HlxMN  =  (AoWjVx*,)  •  •  • ,  Am-iW^xX,)) 


where  Aj  is  the  following  Lx  L  diagonal  matrix  similar  to  Tf. 

A  i  =  diag(/i[0  —  /AM],  h[l  —  /AM],  •  •  • ,  h[L  —  1  —  /AM]). 


(2.19) 


(2.20) 


(2.21) 


Therefore. 


GmnxiHlxMN  — 


/  wwr0A0H^xz, 


WNxL^O^-M-lWlfxL  \ 


WNxL^M-l^M-lW}fxL  J 


V  WnxL^M-I^-O^NxL 
For  GmnxlHlxMN  to  be  Hermitian  we  need  to  have 

wwrmA„w^xi  =  wNxiTnKiW}rxL. 

With  the  form  (2.18)  for  WNxl,  the  above  (2.23)  can  be  simplified  as  follows. 


(2.22) 


(2.23) 


W ff  x  L  fm A,j  Wjy  x  ^ 
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AN-1 

i  =  Y  WjVxJvdiag^fZiV  +  0  —  mAM]h[lN  +  0  —  nAM],  •  •  • , 

1=0 

7 *[IN  +  N-1-  mAM]h[lN  +  N-1-  nAM])W^xN 

AN-l 

=  WNxN  Y  diag(7*[/iV  +  0  —  mAM]h[lN  +  0  —  nAM],  •  •  • , 

1=0 

7 *[IN  +  N-  1  -  mAM]h[lN  +  N-1-  nAM])^xyv 

Therefore,  the  Hermitian  property  (ii)  for  the  matrix  GmnxlHlxMN  holds  if  and  only  if 
AN—l  AN-1 

Y  1*[IN  +  k  -  mAM]h[lN  +  k  -  nAM]  =  Y  h*[lN  +  k  -  mAM^lN  +  k  -  nAM],  (2.24) 

iZ o  i=o 

for  k  =  0, 1, N  —  1  and  0  <  m,  n  <  M  -  1. 

Since  h[n]  and  7[n]  are  periodic  with  period  L  =  MAM,  the  condition  (2.24)  is  equivalent  to 
AN- 1  AN—l 

Y  7* [IN  +  k  —  mAM]h[lN  +  k  +  nAM ]  =  Y,  h*[lN  +  k  —  mAM]-y[lN  +  k  +  nAM],  (2.25) 

i=o  ■  i=o 

for  k  =  0, 1, ...,  N  -  1  and  0  <  m,n  <  M  -  1.  Notice  that  the  difference  between  (2.24)  and  (2.25) 
is  the  difference  of  the  signs  in  the  front  of  the  variable  n. 

The  condition  (2.16)  can  be  obtained  from  condition  (2.25).  This  proves  Lemma  1.  □ 

With  Lemma  1  and  the  alternating  orthogonal  projection  theorem,  we  have  proved  the  following 
convergence  result. 

Theorem  1  When  the  synthesis  and  the  analysis  window  functions  /i[n]  and  7[n]  satisfy  condition 

(2.16) ,  the  iterative  algorithm  (2.9)-(2.11)  converges. 

There  are  two  trivial  cases  where  the  condition  (2.16)  holds.  The  first  case  is  the  orthogonal- 
like  case:  h[n]  =  7[n]  for  all  integer  n.  The  second  case  is  the  critical  sampling  case:  AM  =  N. 
Notice  that  the  continuous  Gabor  transform  is  never  orthogonal-like  unless  the  window  functions 
are  badly  localized  in  the  frequency  domain.  This,  however,  is  not  the  case  for  the  DGT.  The  most 
orthogonal-like  solution  was  studied  by  Qian  et.  al.  in  [9-11].  They  showed  that  it  is  possible 
to  have  the  analysis  window  function  7  very  close  to  the  synthesis  window  function  h  when  h  is 
truncated  Gaussian.  The  error  between  h  and  7  is  less  than  2  x  10-6  while  they  are  of  unit  energy, 
and  therefore  the  error  is  negligible.  We  will  see  numerical  results  later  in  the  next  section. 

We  next  want  to  see  what  the  limit  of  the  iterative  algorithm  (2.9)-(2.11)  is,  under  the  condition 

(2.16) .  Assume  s  is  the  limit  of  the  sequence  s;  and  C  is  the  limit  of  C/.  Then, 

C  =  DmnxmnGmnxlHlxMnC  —  DmnxmnGmnxls > 
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and 


v 


S  =  HlxMnDmNxMnGmNxLS- 

We  want  to  prove 

Gmnxls  ~  DmnxMnGmnxl*, 

i.e.,  the  DGT  of  s  falls  in  the  mask  DmnxMN ■  Since  GMNxlHLxMN  is  an  orthogonal  projection 
and 


DmnxmnGmnxls  =  GmnxlHlxmnDmnxMnGmnxls  +  (/  —  GmnxlHLxmn)DmnxMnGmnxls 
=  GmnxLS  +  (/  -  GmnxlHlxMn)DmnxMnGmnxls,  (2.26) 

we  have  that 

Gmnxls  -L  -(I  -  GmnxlHLxmn)DmnxmnGmnxls, 
where  J.  means  orthogonal.  Since  DmnxMN  is  also  an  orthogonal  projection  and 

-(/  -  GmNxlHlxMn)HmNxMnGmNxL,S  =  {I  —  DmNxMn)GmNxLS , 

we  have  GmnxL$  =  DmnxMnGmnxls •  This  proves  the  following  Theorem  2. 

Theorem  2  Under  condition  (2.16),  the  DGT  cf  the  limit  s  of  the  iterative  algorithm  (2.9)-(2.11) 
falls  in  the.  mask  DmNxMN t 

Gmnxls  =  DmnxMnGmnxls.  (2.27) 

With  the  above  result,  one  might  ask  whether  it  violates  the  known  fact  that  an  image  of  a  TF 
transform  of  a  signal  in  the  TF  plane  can  not  be  of  compact  support.  This  is  because  that  a  signal 
can  not  be  time  and  band  limited  simultaneously.  To  answer  this  question,  we  first  need  to  know 
that  the  above  known  fact  is  true  for  continuous  TF  transforms.  Moreover,  its  proof  is  based  upon 
the  nnirginal  properties  of  TF  transforms.  It  may  not  be  true  for  discrete  TF  transforms.  In  other 
words,  discrete  TF  transforms  may  have  compact  support  [6]. 

For  the  least  squares  solution  x  in  (2.13),  its  Gabor  transform  GmnxL x  is  the  orthogonal 
projection  of  DmnxMnGmnxLS  onto  the  space  of  all  signals  GmnxL x.  Since  GmnxlHlxMN  is 
an  orthogonal  projection,  by  (2.26),  we  have  proved  that  the  least  squares  solution 

x  =  HlxmnDmnxMnGmnxls  =  Sx- 
This  proves  the  following  Theorem  3. 
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Theorem  3  Under  condition  (2.16),  the  first  iteration  Si  of  the  iterative  algorithm  (2.9)-(2.11)  is 
equal  to  the  least  squares  solution  in  (2.13),  i.e.,  si  =  x. 

With  Theorem  3,  one  will  see  in  the  next  section  that  the  iterative  algorithm  (2.9)-(2.11) 
improves  the  least  squares  solution  when  the  number  of  iterations  increases,  and  in  the  meanwhile 
one  does  not  need  to  compute  the  inverse  matrix  in  (2.13).  Theorem  3  also  provides  another  way 
to  compute  the  least  squares  solution  when  condition  (2.16)  holds  on  the  window  functions.  Note 
that  the  least  squares  solution  in  (2.13)  does  not  depend  on  the  synthesis  window  function  h[n]. 
This  means  that  all  the  least  squares  solutions  are  the  same  for  all  pairs  of  synthesis  and  analysis 
window  functions  as  long  as  the  analysis  window  functions  are  the  same,  such  as  the  Gaussian 
function.  Therefore,  the  improvement  from  the  iterative  algorithm  with  window  functions  satisfying 
Condition  (2.16)  is  over  the  least  squares  solutions  not  only  for  the  window  functions  satisfying 
Conditon  (2.16)  but  also  for  other  window  functions. 

3  Numerical  Examples 

In  this  section,  we  test  two  sets  of  window  functions  of  the  DGT.  The  first  set  of  window  functions 
are  the  most  orthogonal-like  ones  obtained  from  [11,  18-19].  For  this  set  of  window  functions, 
their  difference,  and  the  absolute  values  of  the  differences  between  the  left  and  right  hand  sides  of 
condition  (2.16)  are  shown  in  Fig.  3,  respectively.  The  second  set  of  window  functions  only  satisfies 
the  Wexler-Raz  condtion  (2.5)  and  correspondingly,  they  are  shown  in  Fig.  4.  The  test  signal  is 
,s[n]  =  r[n]  +  77(71],  where  x[n]  =  cos(27r((n  +  1)/115)3)  and  77(71]  is  white  Gaussian  noise.  The  mean 
square  errors  between  the  true  signal  x(n)  and  the  filtered  ones  are  shown  in  Fig.  5  and  Fig.  6  for 
the  two  sets  of  window  functions,  respectively.  One  can  clearly  see  the  performance  difference. 

4  Conclusion 

In  this  paper,  we  presented  a  convergence  proof  of  the  iterative  time-variant  filtering  algorithm 
proposed  in  [6,  7].  We  proved  that,  under  the  condition,  the  limit  of  the  time  waveforms  from  the 
iterative  algorithms  has  the  desired  TF  characteristics.  We  also  proved  that,  under  the  condition, 
the  first  iteration  is  equal  to  the  least  squares  solution. 
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Figure  Captions 


Fig.  1  TF  transform  illustration. 

Fig.  2  Iterative  time-varying  filtering  algorithm. 

Fig.  3  The  first  pair  of  window  functions. 

Fig.  4  The  second  pair  of  window  functions. 

Fig  5  The  tot  set  window  functions:  Solid  line:  SNR  vs.  iteration  steps,  where  the  least  squares 
solution  is  marked  by  *;  Dashed  line:  The  errors  between  masked  and  unmasked  DGT  of  the 

iteration  solutions. 

Fig.  6  The  second  set  window  functions:  Solid  Une:  SNR  vs.  iteration  steps;  Dashed  line:  The  errors 
between  masked  and  unmasked  DGT  of  the  iteration  solutions.  The  least  squares  solution  is 

marked  by  *.  . 
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(a)  TF  of  noise 


(d)  (e) 

Figure  1:  TF  transform  illustration. 


Figure  2:  Iterative  time-varying  filtering  algorithm. 
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Figure  3:  The  first  pair  of  window  functions. 
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Figure  4:  The  second  pair  of  window  functions. 
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Figure  5:  The  first  set  window  functions:  Solid  line:  SNR  vs.  iteration  steps,  where  the  least 
squares  solution  is  marked  by  *;  Dashed  line:  The  errors  between  masked  and  unmasked  DGT  of 
the  iteration  solutions. 
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Abstract 

In  this  paper,  the  determination  of  multiple  frequencies  in  undersampled  waveforms 
is  studied  using  multiple  smaller  size  discrete  Fourier  transforms  (DFT).  Given  the  sizes 
of  multiple  DFT,  a  range  for  the  detectable  frequencies  in  undersampled  waveforms  is 
presented. 

1  Introduction 


One  intuitive  way  to  detect  the  single  frequency  /  in  a  single  frequency  complex-valued 
waveform  x(t)  is  first  to  sample  x(t)  at  a  sampling  frequency  /,  >  /  and  then  to  implement 
the  X  point  discrete  Fourier  transform  (DFT)  with  N  >  fs  and  a  single  peak  in  the  DFT 
domain  can  be  seen.  The  reason  why  the  above  sampling  frequency  fs  does  not  have  to  be 
at  least  twice  of  the  frequency  /  is  because  the  frequency  of  the  waveform  x(<)  is  only  single 
sided.  When  the  frequency  /  is  large,  the  sampling  frequency  is  also  large  in  this  method. 
Several  methods  to  detect  a  single  frequency  in  undersampled  waveforms  have  appeared,  see 
for  example  [1-4].  The  basic  idea  for  these  methods  is  to  use  multiple  DFTs  with  smaller 

sizes  for  undersampled  waveforms  with  different  sampling  rates.  One  of  the  advantages  of 

*  Department  of  Electrical  and  Computer  Engineering,  University  of  Delaware,  Newark,  DE  19716.  Email: 
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under  Grant  N00014-98-1-0644,  and  the  Unversity  of  Delaware  Research  Foundation. 
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using  undersampled  waveforms  is  the  hardware  cost  reduction  in  applications  [5].  In  some 
applications,  such  as  velocity  synthetic  aperture  radar  (VSAR)  [8-9],  the  received  signals 
may  be  of  undersampled  nature. 

In  this  paper,  we  study  the  estimation  of  multiple  frequencies  from  undersampled  complex 
valued  waveforms  by  also  using  multiple  DFTs  for  undersampled  waveforms  with  different 
sampling  rates.  Given  the  sizes  of  these  DFTs  (or  the  sampling  rates)  and  the  number  of 
multiple  frequencies,  we  provide  a  range  of  the  detectable  frequencies.  Note  that  a  different 
approach  was  studied  in  [12]  in  angle  estimation. 

2  Multiple  Frequency  Estimation 

Without  loss  of  generality;  we  assume  that  the  multiple  frequencies  in  a  waveform  x(t)  are 
/,  =  Ni,f2  =  N2, fP  =  Np  and  Nu  N2, ...,  Np  are  all  nonnegative  integers.  For  the  integer 
frequency  assumption,  see  for  example  [1].  The  waveform  x(t )  is  represented  by 

x(t)  =  ±A,e2’^, 

1=1 

where  At.  1  <  l  <  p,  are  p  nonzero  complex- valued  coefficients.  Let  f„  —  m  be  the  sampling 
frequency  with  a  positive  integer  m.  Then  the  sampled  waveform  is 

*»M  =  *(-)  =  E  n  £  Z.  (1) 

171  1=1 

The  problem  of  interest  is  to  detect  the  multiple  frequencies  Nt,  1  <  l  <  p,  from  the  multiple 
undersampled  x(t):  xmr[n],  n  €  Z,  with  mr  <max{lV1,  ...,NP}  for  1  <  r  <  7-  The  main  idea 
in  the  following  study  is  to  implement  the  mr  point  DFT  for  the  undersampled  waveforms 
xmr[n],  0  <  n  <  mr  -  1  for  1  <  r  <  7.  From  these  multiple  DFTs,  the  multiple  frequencies 
can  be  detected.  To  study  the  details  for  a  general  solution,  let  us  first  see  the  single 
frequency  case,  i.e. ,  p  =  1  in  (1).  In  this  case,  let  N  =  N\.  Then 

xmr  [n]  =  Ae2*jNn/m' ,  n  e  Z,  A  ±  0.  (2) 
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Let  N  =  nTmr  +  kr,  0  <  kr  <  mT  -  1,  i.e.,  kT  =  N  mod  mr,  then  the  mT  point  DFT  of 
xmr[n ],  0  <  n  <  mT  -  1,  is 

DFTmr(xmp[n])  =  A5{k  -  kr),  0  <  fc  <  mr  -  1.  (3) 

That  is,  the  residue  kr  =  N  mod  mr  can  be  detected  from  the  mr  point  DFT  of  xmr  [n]  for 
1  <  r  <  7.  Therefore,  to  detect  the  frequency  N  becomes  to  determine  the  integer  N  from 
all  the  residues  kr  for  1  <  r  <  7.  The  following  lemma  tells  us  the  range  of  the  detectable 
N  given  mi,m2, m7,  which  is  called  the  dynamic  range  in  [1]. 

Lemma  1  The  above  single  frequency  N  can  be  uniquely  determined  if 

0  <  N  <  lcm{mi,m2,  (4) 

where  1cm  denotes  least  common  multiple. 

Proof:  Let  m  =lcm{m1,m2,  For  N  >  0,  let  SN  denote  the  1  x  7  integer  vector 

SN  =  (ki(N),  k2{N),  •  •  • ,  kj(N))  with  kr(N)  =  N  mod  mr.  (5) 

To  prove  Lemma  1,  it  is  sufficient  to  prove  that,  for  any  two  different  integers  Nx  and  N2 
with  0  <  A',  ±  N2  <  m,  the  vectors  SNl  ±  SN2.  Assume  this  is  not  true,  i.e.,  there  exist  two 
integers  .V,  and  N2  with  0  <  Ni  ±  N2  <  m  such  that  SNl  =  SNl.  In  other  words,  Ni  -  N2  is 
a  multiple  of  mr  for  each  1  <  r  <  7.  It  implies  Nx  -  N2  =  nm  for  a  nonzero  integer  n,  which 
is  impossible  when  0  <  NX,N2  <  m.  This  contradicts  with  the  assumption,  i.e.,  Lemma  1  is 
proved.  □ 

Lemma  1  is  basically  the  Chinese  Remainder  Theorem  (CRT),  see  for  example  [7].  It  is 
clear  that  the  single  frequency  N  can  be  found  from  the  detected  residues  &r,  1  <  r  <  7,  by 
using  the  CRT.  The  proof  suggests  another  method  to  detect  N  by  simply  looking  up  the 
table  of  the  vectors  SN  defined  in  (5),  which  can  be  done  in  priori.  As  a  remark,  the  reason 
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for  maintaining  the  above  proof  is  to  motivate  the  following  general  solution  for  multiple 
frequency  estimation. 

We  now  study  the  multiple  frequency  estimation  problem,  where  p  frequencies  appear  in 
a  waveform  x (t)  with  its  undersampled  versions  shown  in  (1)  with  m  =  mi,m2,  Let 

kiyr  =  Ni  mod  mr,  1  <  l  <  p,  1  <  r  <  7.  (6) 


Then  the  mT  point  DFT  of  xmr  [n],  0  <  n  <  mr  —  1,  is 

DFTmr  {xmr  M)  =  £  At6{k  -  kltr),  0  <  k  <  mr  -  1,  1  <  r  <  7-  (7) 

1= 1 

This  tells  us  that  the  residue  frequencies  kl>r  can  be  seen  as  peaks  in  the  DFT  domain,  i.e., 
they  can  be  detected  from  the  mT  point  DFT  of  xmr[n\.  Thus,  the  determination  of  the 
original  p  frequencies  N\,  •••,  Np  becomes  the  determination  of  the  nonnegative  integers 

N\,N2, ...,  Np  from  their  residues  kiiT  =  Nr  mod  mr  for  1  <  l  <  P  and  1  <  r  <  7.  The 
following  result  gives  a  range  of  Nu  N2,  ...,  Np  for  the  uniqueness  of  the  determination. 

Theorem  1  Assume  that  a  complex  valued  waveform  x (t)  contains  p  different  frequencies 
-  yt  >  0  for  1  <  /  <  p.  Let  mr,  1  <  r  <  7,  be  7  sampling  rates  in  the  undersampled 
versions  jmr[n]  of  x(t)  in  (1)  with  m  =  mT,  1  <  r  <  7.  Let 

7  =  r)p  +  6,  0  <  6  <  p,  (8) 


where  1 )  is  a  nonnegative  integer.  Then  the  p  frequencies  fi  —  Ni  >  0  for  \  <  l  <  p  can  be 
uniquely  determined  by  using  the  mr  point  DFT  of  imr[n]  for  1  <  r  <  7  i/ 

max{Ni,  N2,  •  •  • ,  Np}  <  max{m,  mum2,  •  •  • ,  m7},  (9) 


where 


m 


a  /  mini<ri<r2<...<r,<7  lcm{mri,mr2,  •  •  • ,  mrJ,  if  p  >  0 


*1 


0, 


otherwise, 


(10) 


where  77  is  defined  in  (8). 
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Proof:  If  m  <  maxjmi,  m2,  •  •  • ,  m7},  Theorem  1  is  straightforward  by  simply  using 
the  single  DFT  for  the  maximum  mr,  1  <  r  <  7.  Therefore,  in  what  follows  we  assume 
m  >  max{m1,m2l  •  •  -  ,m7}. 

For  an  integer  N,  let  kltt.(N)  =  N  mod  mr.  For  nonnegative  integers  N\,  N2, ...,  Np,  let 
Sr(Nu  •  •  • ,  Np)  be  the  following  set 

Sr(Nu  ■  ■  ■ ,  Np)  =  {kX'riNJ,  •  •  • ,  kp,r(Np)}.  (11) 

Let  S(Ni,  •  •  • ,  Np)  be  the  following  product  set 

S(Nu---,Np)  =  S1(Nir--,Np)x---xSy(N1,-..,Np).  (12) 

To  prove  Theorem  1,  it  is  sufficient  to  prove  the  following  uniqueness:  if  there  are  two  sets  of 
p  different  nonnegative  integer  frequencies  {NUN2,...,NP}  and  {Mlt  M2, ...,  Mp}  such  that 
5(Ari,  •  •  • ,  Np)  =  S(M\,  •  •  • ,  Mp),  max{Nu  N2,  •  •  • ,  Np}  <  m,  and  max{Mi,  M2,  ■  •  • ,  Mp}  < 
m.  then  the  two  frequency  sets  are  equal,  i.e.,  {NUN2, ...,  Np}  =  {MUM2, ...,  Mp }.  We  first 
prove  that  {Nu  N2, Np  }  C  {Mu  M2, ...,  Mp). 

By  the  assumption  m  >  max{m1,m2, m7}  in  the  beginning  of  the  proof,  we  know  that 
7/  >  1 .  By  the  condition  S(NU  •  •  • ,  iVT)  =  S(MU  •  •  • ,  Mr)  we  obtain  that  for  Nx  and  each  mr 
there  exists  at  least  one  integer  denoted  by  yr  with  1  <  yr  <  p  =  r  such  that  Nx  -  MVr  =  0 
mod  rnr  for  1  <  r  <  7.  By  (8),  on  the  other  hand,  7  is  at  least  77  times  larger  than  r  =  p. 
This  means  that  there  are  at  least  77  many  mn , mr,  with  1  <  rx  <  r2  <  •  •  •  <  rv  <  7  such 
that 

M/r,  —  =  •  •  •  =  Myrti  =  M/0, 

where  1  <  l0  <  t.  Thus,  jVx  -  Mto  =0  mod  mrc  for  e  =  1, 2, ...,  77.  By  conditions 
max{./Vi,  N2, Nt}  <  m  <lcm{mri, ...,  mri)}  and  max{Mi,  M2 , ...,  MT}  <  m  <lcm{mn, ...,  mri)} 
from  the  definition  of  m  in  (10),  we  conclude  Nx  =  M/o  similar  to  the  proof  of  Lemma  1.  This 
proves  that  Nx  €  {Mu  M2, ...,  Mp },  and  therefore  {Arx,  N2, ...,  iVp}  c  {Mu  M2, ...,  Mp}  can  be 
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{Mi,  M2, Mp}  C  {Nu  N2,  Np}. 


similarly  proved.  By  the  same  argument,  we  can  prove 

This  proves  Theorem  1.  □ 

Prom  the  proof,  similar  to  what  was  mentioned  after  the  proof  of  Lemma  1,  the  p 

frequencies  Nu  ....  K  can  be  detected  by  looking  up  the  table  of  the  product  sets 
S(JV„  fV2, ....  N„)  defined  in  (11)-(12).  The  uniqueness  in  Theorem  1  guarantees  the  correct¬ 
ness  of  the  solution  when  the  condition  (9)  is  satisfied,  i.e.,  when  these  frequencies  are  in  the 
range  defined  by  (9).  Other  determination  methods  similar  to  the  CRT  for  single  frequency 
estimation  might  exist  and  are  definitely  interesting. 

3  Example 

in  this  section,  we  see  one'simple  example.  Consider  the  case  of  two  frequencies  JV,  and  AT,. 
We  choose  m,  =  17,  m2  =  19,  m,  =  20,  and  m4  =  21.  In  this  case,  p  =  2,  7  =  4,  and 
therefore  r,  =  2  in  (8).  Clearly,  m  =  mtm2  =  323.  By  Theorem  1,  all  two  different  frequencies 
.V,  and  ;V2  in  the  range  [0, 322]  can  be  uniquely  determined  from  the  undersampled  waveforms 
with  sampling  rates  17,  19,  20  and  21  by  using  17,  19,  20  and  21  point  DFTs,  respectively. 
We  ran  see  that  the  sampling  rates  are  more  than  15  times  less  than  the  Nyquist  sampling 

rate  when  A'i  and  iV2  are  close  to  322. 

4  Conclusion 

1,.  this  paper,  we  studied  the  estimation  of  multiple  frequencies  in  undersampled  complex 
valued  waveforms  using  multiple  DFTs.  Given  the  sizes  of  these  multiple  DFTs  or  the 
undersampling  rates,  we  provided  a  range  for  the  detectable  frequencies.  Our  example  shows 
that  a  significant  sampling  rate  reduction  over  the  Nyquist  sampling  rate  can  be  achieved. 
It  should  be  noticed  that  the  range  determined  in  Theorem  1  might  not  be  the  maximal 

one.  The  search  of  the  maximal  range  given  p,  m, . m,  is  under  our  current  investigation. 

After  this  paper  was  written,  some  results  on  the  maximal  range  were  obtained  in  [10] 


with  a  sufficient  condition  on  the  multiple  frequencies.  The  approach  in  this  paper  might 
be  generalized  to  multidimensional  frequency  estimation  by  using  multidimensional  Chinese 
Remainder  Theorem  in  [6].  In  [11],  the  results  have  been  recently  applied  in  enlarging  the 
dynamic  range  of  the  detectable  parameters  for  polynomial  phase  signals  using  multiple  lag 
diversities  in  high-order  ambiguity  functions. 
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Abstract 

In  this  paper,  we  investigate  the  feasibility  of  blind 
,,1/ial  recovery  from  undersampled  data  collected  from 
plurality  receivers.  We  show  that  although  an  un- 
sampled  communication  system  is  not  completely 
i,l,  n  ti fable  m  general,  such  an  obstacle  can  be  over - 
,„me  by  employing  proper  precoding  with  an  arbitrary 
amount  of  the  bandwidth  expansion  in  the  transmitter. 

1 1^  mam  contribution  of  this  study  is  the  formulation 
,./  n  generic  framework  for  the  undersampled  systems, 
and  the  derivation  of  conditions  for  a’ class  of  filters 
uluch  we  term  ambiguity  resistant  precoders. 

1  Introduction 

Because  of  its  practical  significance,  blind  identi- 
lirntion  of  FIR  channel  has  received  considerable  at- 
i.-ntion  in  the  past  decade  in  communications  and  sig¬ 
nal  processing  [1].  To  date,  almost  all  research  on 
|,|ind  identification  deals  with  channel  outputs  that 

sampled  at  least  at  the  baud  rate.  In  certain  appli¬ 
cations.  a  communication  system  may  be  unrfcrsam- 
,,hd  with  rate  1  /LT(L>  1),  for  reasons  ranging  from 
fixed  hardware  to  variable  data  rates  of  source  signals. 

(  Varly,  perfect  signal  recovery  is  not  possible  in  these 
-.  .-narios  However,  when  a  collection  of  low  rate  ob- 
-  rvations  is  available,  it  may  be  feasible  to  restore  the 
-<>urce  signals  by  combining  partial  information  from 
different  receivers 

In  this  paper,  we  study  the  application  of  multiple 
receivers  in  blind  source  recovery  for  undersampled 
communication  systems.  To  put  this  into  perspective, 

•  onsider  an  Af -receiver  undersampled  system  depicted 
in  Figure  1,  where  L  is  an  integer.  Since  the  receiver 
part  is  mathematically  equivalent  to  a  multiple  input 
and  multiple  output  (MIMO)  system,  one  can  at  most 
blindly  recover  the  input  towards  a  matrix  ambiguity. 

•This  work  was  sponsored  in  part  by  the  Air  Force  Office 
••f  Scientific  Research  (AFOSR)  under  Grants  No.  F49620-97- 
1-0318  and  No.  F 49620-97- 1-0253,  and  the  National  Science 
Foundation  CAREER  Program  under  Grants  MIP-9703074  and 
MIP-9703377. 
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The  question,  then,  becomes:  is  there  any  affordable 
way  to  restore  the  blind  identifiability? 

Filterbank  precoding  has  been  proposed  to  com¬ 
bat  ISI  channels  in  wireless  communications  [3].  The 
same  concept  has  been  applied  by  Giannakis  for  blind 
channel  identification.  In  [4],  it  is  shown  that  an 
FIR  channel  can  be  blindly  determined  with  mini¬ 
mum  redundancy  introduced  by  precoding.  Motivated 
by  these  studies,  we  propose  to  use  precoding  tech¬ 
niques  to  solve  the  blind  identification  problem  for  the 
undersampled  system.  More  specifically,  we  study  a 
class  of  ambiguity  resistant  precoders  which  is  capa¬ 
ble  of  removing  the  ambiguity  introduced  by  under- 
sampling.  In  the  remainder  of  this  paper,  we  shall 
denote  the  system  in  Figure  1  with  rate  K/N  pre¬ 
coder  as  [( K,N)\{L,M )].  A  regular  communication 
system  with  transmission  induced  redundancy  can  be 
cast  into  the  same  framework. 

2  A  Generic  Framework 

Throughout  our  discussion,  the  follow  assumptions 
are  invoked  for  the  [(K,  N).(L,  M)] system  under  con- 
sideration: 

A.l:  The  precoding  filter  has  dimension  S  x  A',  where 

N  >  K; 

A. 2:  N/K  x  M/L  >  1,  i.e.,  NM  >  KL. 

A.l  is  clearly  required  in  the  precoding,  otherwise 
there  will  be  no  increase  in  redundancy  which  ren¬ 
ders  blind  identification  impossible.  The  same  is  true 
A. 2  since  MN/KL  quantifies  the  overall  system  re¬ 
dundancy.  Under  A.l  and  A.2,  there  are  still  many 
possible  combinations  of  the  four  parameters,  K,  N, 
L,  and  M,  which  make  a  unified  analysis  difficult.  The 
following  lemma  simplifies  the  our  data  model  by  cast¬ 
ing  any  system  that  satisfies  A.l  and  A.2  into  a  generic 
framework. 

Lemma  1  Any  [(X,  N_)\ (L,  M)]  system  with  N  > 
X  and  N_  M.  >  K.  L  can  be  cast  into  .a  generic 
[{K,N)\{N,M)]  system  with  K  <  N  <  M. 
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Precoding  Multi-receiver  Undersampling 


Figure  1:  Precoding  for  undersampled  an  antenna  array  system 


When  L  =  A£  =  1,  the  system  becomes  a  symbol- 
rate  communication  system  with  input  redundancy. 
The  reader  is  referred  to  [5]  for  proofs  of  the  above 
lemma  and  theorems  in  the  ensuing  sections. 

3  Blind  Identification 

The  output  of  the  generic  system  in  Figure  2  can 
be  expressed  as 

yw.iU)  =  H„,,(r)u  „xi(r)  =  H(z)G*xkU)skxi(--), 

where  H(r)  characterizes  the  unknown  channel, 
whereas  G(:)  represents  the  known  precoder.  The 
problem  herein  is  to  determine  the  input ,  s(z),  and  in 
many  eases  the  channel,  H(;),  from  the  output,  y(*)* 
using  only  knowledge  of  the  precoder  filter ,  G(r). 

To  facilitate  the  forthcoming  discussion,  let  us  first 
lay  some  groundwork  by  reviewing  an  important  result 
regarding  FIR  MIMO  systems. 

Theorem  1  [6]  For  an  N -input  M-outpui  (M  >  N) 

FIR  system  with  transfer  function  H(r),  the  following 
statements  are  equivalent: 


Figure  2:  A  generic  representation 


1.  H(z)  is  irreducible,  i.t.,  rank[H(z)j  =  N,  Vz  €  C 
and  rank[H«]  =  N; 

2.  H(z)  and  the  input  vector  u(z)  can  be  identified 
up  to  an  N  x  N  invertible  constant  ambiguity  ma- 
trtx  from  the  outputs  using  second  order  statistics. 

If  the  precoder  is  designed  to  be  irreducible,  the 

composite  transfer  function,  Hc(z)  =  H(z)G(z),  is 
clearly  irreducible.  Theorem  1  asserts  that  the  system 
input  s(z)  can  only  be  determined  within  a  A  x  K  ma¬ 
trix  ambiguity  directly  from  y(z).  However  the  prob¬ 
lem  of  interest  here  is  to  find  s(z)  and  Hc(z)  such  that 

y(z)  =  H«(z)s(z),  subject  to  Hc(z)  =  H(z)G(z), 

where  G(z)  is  a  known  precoder.  This  motivates  the 
following  blind  identifiability  concept. 

Definition:  The  system  in  Figure  2  is  blindly  iden¬ 
tifiable  if  s(z)  =  as(z)  and  H(z)  =  0H(z),  where  a 
and  0  are  two  scalars,  are  the  only  solution  for  the  fol¬ 
lowing  system  given  the  output  y(z)  and  the  precoder 
G (z):  y(z)  =  H(z)G(z)s(z). 

We  tackle  the  blind  identification  problem  in  two 
steps:  (i)  determine  what  we  term  the  ambiguous  in¬ 
puts, 

u(z)  :  Tu(z)  =  u(z),  (1) 

where  T  is  an  N  x  N  fully  rank  constant  matrix, 
blindly  from  the  system  output  y(z).  It  can  be  accom¬ 
plished  using  many  existing  approaches  when  H(z)  is 
irreducible;  (ii)  Once  u(z)  is  identified,  the  blind  iden¬ 
tification  problem  reduces  to  whether  or  not  s(z)  can 
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jw>  determined  from  u(z)  in  the  presence  of  a  full  rank 
imbiguity  matrix  T  (or  T-1).  We  then  show  that 
exists  a  class  of  ambiguity  resistant  precoders 
w  Inch  can  resolve  the  matrix  ambiguity  without  addi¬ 
tional  information.  Since  Step  (i)  is  well  studied  (6,  7], 
Mir  focus  in  the  remainder  of  this  paper  will  be  de¬ 
voted  to  the  precoder  part. 

31  Ambiguity  Resistant  (AR)  Precoders 

We  first  define  the  concept  of  ambiguity  resistance. 

Definition:  An  N  x  K  FIR  irreducible  precoding  fil¬ 
ler  G(z)  i5  ambiguity  resistant  if  its  input  s(z)  can  be 
uniquely  determined  up  to  a  scalar  from  its  ambiguous 
output ,  {u(z)  :  Tu(z)  =  u(z)},  where  T  is  an  un¬ 
known  tnvertible  N  x  N  constant  matrix. 

A  precoder  is  not  ambiguity  resistant  if  there  exists 
,i  non-identity  matrix  R  ^  aT  and  s(z)  ^  0s(z)  such 
i hat  Ru(r)  =  G(z)s(z),  or  equivalently, 

RT-1G(r)s(z)  =  G(r)5(z), 
for  any  given  s(z). 

Denote  E  =  RT“l  and  rewrite  the  above  equation 
using  matrix  input  and  matrix  output,  the  precoder  is 
ambiguity  resistant  unless  there  exists  an  N  x  N  full 
rank,  nonidentical,  constant  matrix  E  and  a  A'  x  A' 
nomdentical  matrix  X(z)  such  that 

EG(z)  =  G(z)X(z)  (2) 

X(z)  is  the  polynomial  ambiguity  of  the  input  which 
cannot  be  determined.  Note  that  since  G(z)  is  irre¬ 
ducible,  det(X{z))  in  (2)  is  a  nonzero  constant,  i.e., 
X(:)  is  unimodular. 

The  above  is  summarized  in  the  following  theorem. 

Theorem  2  An  .V  x  A'  FIR  irreducible  precoding  fil¬ 
ter  G(:)  is  ambiguity  resistant  if  and  only  if  there  does 
not  exist  an  N  x  S  full  rank  constant  matrix  E  ^  al 
for  any  constant  a,  and  a  K  x  K  matrix  X(z)  gk  01 
for  any  consfanf  0t  such  that  the  above  identity  (S) 
holds. 

To  examine  the  ambiguity  resistancy  of  a  given  pre¬ 
coder,  note  that  it  follows  from  Equation  (2)  that 
X(c)  =  G“1(z)EG(z).  Hence, 

EG(z)  =  G(:)G-l(:)EG(r).  (3) 

By  representing  the  above  equation  in  the  time  do¬ 
main,  one  may  check  the  ambiguity  resistancy  of  G(z) 
by  solving  a  linear  equation  set.  If  E  =  al  for 
some  constant  o  is  the  only  nonzero  solution,  then 


G(r)  is  ambiguity  resistant.  Otherwise,  it  is  nec¬ 
essary  to  check  whether  X(r)  =  01  for  some  con¬ 
stant  3  or  EG(:)  =  G(z).  since  it  is  possible  to  have 
EG(;)  =  G (:)  with  E  #  al. 

When  A  =  I,  X(z)  =  a  for  some  nonzero  constant 
a  is  always  true.  By  Theorem  2,  the  following  corol¬ 
lary  is  straightforward. 

Corollary  1  An  N  x  1  FIR  invertible  precoding  filter 
G(z)  is  always  ambiguity  resistant  for  N  >  1. 

Corollary  2  Any  N  x  K  with  K  >  1  block  precoder 
G(z),  i.e.,  G(z)  is  a  constant  matrix,  is  not  ambiguity 
resistant. 

Corollary  1  is  not  surprising  since  when  K  =  1, 
the  [(1,  N);(N,  M)]  system  reduces  to  a  conventional 
oversampled  system  which  is  clearly  identifiable.  With 
this  result,  we  only  need  to  consider  the  case  of  K  >  1. 

Next,  we  want  to  present  some  necessary  conditions 
on  the  ambiguity  resistance. 

Theorem  3  //  an  N  x  K%  K  >  1,  FIR  irreducible 
precoder  G(z)  is  ambiguity  resistant,  then 

1.  there  exist  no  full  rank  constant  matrix  E  and 
invertible  K  x  K  polynomial  matrix  V(z)  such 
that  the  first  column  in  matrix  EG(z)V(z)  is 
( 1,0,0,  ...,0)T; 

2.  N  >  K. 

3.  the  order  Q  of  G(z)  must  satisfy  the  following 
lower  bound 


Q> 


N7  +  A'2  -  1 
NK 


The  above  Theorem  will  allow  us  to  construct  a 
family  of  AR  precoders  in  Section  3.2. 


Lemma  2  [8]  When  G*x*(z)  is  irreducible,  its 

Smith-McMillan  form  ts  given  by 


G  = 


I#c  XK 

0 


V(*), 


where  WNXN(z)  and  V*XJC(z)  arc  referred  to  as  the 
left  and  right  unimodular  matrices,  respectively,  in  the 
Smith-McMillan  decomposition  of  G(z). 


The  left  unimodular  matrix,  W(z),  can  be  further 
decomposed  into 

W(M  =  [wj,XJt(z)  Wtx(M_K)(z)]  . 
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Clearly,  WJ(z),  associated  with  the  identity  part  of 
the  middle  Smith  form,  essentially  defines  the  column 
span  of  G(r).  The  Smith-McMillan  decomposition  of 
a  tall  invertible  matrix  can  be  simplied  as  G(:)  = 
W,(c)V(r),  where  W*(r)  is  invertible. 

Theorem  4  A  precoding  filter  G(r)  is  ambiguity  re¬ 
sistant  if  and  only  if  there  exists  no  N  x  N  con¬ 
stant  matrix  E  such  that  both  W (z)  and  EW (z), 
W *(z)  ^  aEW*(z),  are  left  unxmodular  matrices  in 
the  Smith-McMillan  decompositions  of  G(z). 

3.2  A  Family  of  AR  Precoders 

Motivated  from  the  necessary  conditions  in  Theo¬ 
rem  3,  we  now  want  to  construct  a  family  of  ambigu¬ 
ity  resistant  precoders  G(z).  We  have  the  following 
result. 

Theorem  5  For  any  positive  integer  N  >  1,  the  fol¬ 
lowing  matrix  G(z)  with  size  N  x  ( N  —  1)  is  ambiguity 
resistant: 


4  Algebraic  Sequence  Identification 
Algorithm 

We  derive  in  this  section  an  algebraic  algorithm 
which  can  accomplish  blind  identification  with  a  finite 
number  of  observations.  For  this  purpose  we  only  con* 
sider  noise-free  data  without  claiming  anything  con¬ 
cerning  the  optimality  of  the  algorithm. 

Since  the  ambiguous  precoder  output,  {u[n]J,  can 
be  identified  using  one  of  the  existing  multichannel 
blind  identification  algorithms,  e.g.,  [9,  7].  we  limit 
ourselves  to  the  problem  of  removing  the  matrix  am¬ 
biguity  from  {u[n]}. 

Given  a  finite  collection  of  the  ambiguous  precoder 
outputs,  {u[nl};:£,  it  is  not  difficult  to  establish  the 
following  relations  from  (1), 

'  «I0]  j  f  s[-g] 

diag(T  -  T)  :  :  ,  (5) 

.  u[fl  -  1]  J  s[fl  -  1] 


1  0  0 

r-'r  i  0 

0  z~y  1 

Gu)=  .  . 

0  0  0 

.0  0  0 

for  an  integer  7^0. 


0 

0 

0 

z-"> 

0 


0 

0 

0 

1 

z-> 


The  above  theorem  can  be  easily  modified  for  con- 
tructing  simple  AR  precoders  of  any  size. 

3.3  System  Identifiability 

W'ith  the  establishment  of  ambiguity  resistant  pre¬ 
coders,  we  now  give  a  set  of  sufficient  conditions  for 
blind  identifiability  of  the  system  in  Figure  2. 


Theorem  6  The  system  depicted  in  Figure  S  is 
blindly  identifiable  when 


where  Q„  is  a  block  Toeplitz  matrix  of  G[n]. 

Upon  denoting  tj  the  ith  column  of  T,  t  = 
[tr  -t-r.  and  U  =  [nT[0)  -  uT[R-l))T 
we  may  rearrange  Equation  (5)  with  respect  to  its  un¬ 
knowns,  namely,  s  and  t,  and  obtain 

(-«.  U|[|]=0  (6) 

Since  we  have  NR  equations  with  (/J  -f  Q)K  +  N 2 3 
unknowns,  the  above  equation  set  becomes  overdeter¬ 
mined  as  R  increases,  provided  that  N  >  K.  The 
system  can  be  identified  using  simple  least  squares  fit¬ 
ting  when  G(z)  is  ambiguity  resistant. 

The  above  identification  procedure  can  be  summa¬ 
rized  as  follows, 

1.  Determine  the  precoder  output  vectors  within 
an  N  x  N  matrix  using  any  existing  MIMO 
blind  identification  method  (e.g.y  the  subspace 
approach  in  [7,  6]). 


1.  G(z)  is  ambiguity  resistant; 

f.  H(z)  is  irreducible; 

3.  G(r)  has  orderQ  >  \Zi*z+9±l-Qh-.iQi  whtre 
R«  =  and  Qit  «  order  ofH(z). 


2.  Form  a  linear  equation  set  using  the  ambiguous 
precoder  output  vectors,  {u[n]}^^,  as  in  (6). 

3.  Determine  elements  of  the  ambiguity  matrix  from 
the  the  least  significant  singular  vector  of  (??). 

4.  Recover  the  message  signals  as  s(z)  = 
G~l(x)T~,i»(z). 
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5  Numerical  Examples 


Some  numerical  results  are  presented  in  this  sec- 
1,011  to  validate  the  identifiability  and  the  efficacy  of 
proposed  algorithms.  All  examples  involved  an 
o. antenna  system  with  the  unsampling  rate  3.  The 
follow  ambiguity  resistant  precoder  described  in  Sec- 

tion  3  2  was  used:  G(z)  = 


.-2 


1 

,-2 


The  sys- 


,em  simulated  is  [(2, 3);  (3,8)];  and  the  order  of  the 


channel  is  2. 

The  closed-form  input  estimation  approach  de¬ 
scribed  in  [7]  was  used  to  determine  the  ambiguous 
precoder  output,  u[n].  Only  30  estimated  vectors  were 
applied  to  the  proposed  method.  Figure  3  compares 
the  signal  constellations  of  the  antenna  outputs,  the 
recovered  precoder  outputs,  and  the  recovered  signals. 
As  shown  in  Figure  3  (c),  existing  approaches  can  only 
restore  the  transmitted  signals,  i.e.,  the  precoder  out¬ 
puts,  within  an  matrix  ambiguity.  However,  with  pre¬ 
coding  and  the  algorithm  presented  in  this  paper,  the 
symbol  sequence  can  be  blindly  recovery  without  sig¬ 
nificant  increase  in  bandwidth. 

Figure  4  shows  how  the  mean-square  error  (MSE) 
of  the  symbol  estimates  varies  with  the  SNR. 
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Figure  3:  Signal  Constellations  before  and  after  Blind 
Recovery 


6  Concluding  Remarks 

In  this  paper,  we  have  shown  that  by  introducing 
redundancy,  albeit  minimum,  at  the  input  through 
precoding  techniques,  blind  identification  can  be  ac¬ 
complished  for  undersampled  systems  in  most  scenar¬ 
ios.  An  important  concept  on  precoders,  i.e.,  ambigu¬ 
ity  resistant  precoders,  has  been  introduced  and  used 


in  the  blind  identification.  Some  conditions  for  ambi¬ 
guity  resistant  precoders  have  been  given  and  a  family 
of  such  precoders  has  been  presented.  Also  presented 
is  an  algebraic  algorithm  which  determines  the  un¬ 
knowns  of  an  undersampled  system  with  a  finite  num¬ 
ber  of  data  samples. 
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Abstract 

Ambiguity  mutant  (AR)  preceding  has  been  recently 
proposed  in  inter  symbol  interference  (ISI)  and  multipath 
cancellations,  where  the  ISI/multipath  channel  may  have 
frequency-selective  fading  characteristics  and  its  knowledge 
is  not  necessarily  known.  With  the  AR  precoding,  nodi i/er_ 
j»ly  is  necessary  at  the  receiver.  In  the  precoding,  the  AR 
property  for  a  precoder  plays  on  important  rule.  In  this  re¬ 
search  we  introduce  the  concepts  of  precoder  distance  and 
optimal  precoders,  and  characterize  and  construct  all  opti¬ 
mal  systematic  AR  precoders,  when  additive  channel  ran¬ 
dom  noue  is  concerned.  A  necessary  and  sufficient  con¬ 
dition  for  an  A  R  precoder  to  be  optimal  is  given,  which  is 
easy  to  check.  With  the  optimal  precoders,  numerical  sim¬ 
ulations  are  presented  to  show  the  improved  performance 
over  the  known  AR  precoders  in  ISI  cancellation  apphea- 
Uons. 

1  Introduction 

Intersvmbol  interference  (ISI)  and  multipath  fading  are 
important  problems  in  digital  communications.  Precod¬ 
ing  is  one  of  the  techniques  for  the  ISI/multipath  cancel¬ 
lation  The  conventional  precoding  techniques,  such  as 
Tomlmson-Harashima  (TH)  precoding  and  trellis  precod¬ 
ing,  and  other  ISI  cancellation  techniques,  such  as  deci¬ 
sion  feedback  equalizers,  usually  suffer  from  the  spectrum- 
null  characteristics  in  frequency-selective  fading  channels. 
Meanwhile,  the  conventional  precoding  methods  require 
the  knowledge  of  the  ISI  channel  at  the  transmitter,  i.e., 
a  feedback  channel  is  needed.  Recently,  a  new  precoding 
technique  has  been  introduced  in  [1-6].  Unlike  the  con¬ 
ventional  precoding  the  new  precoding  expands  the  band¬ 
width  in  a  minimum  amount  as  an  expense.  The  advan¬ 
tages  of  the  new  precoding  are  the  following:  when  there  is 
no  other  noise  but  the  ISI,  it  provides  an  ideal  linear  FIR 
equalizer  at  the  receiver  no  matter  whether  or  not  the  ISI 
channel  has  spectrum-null;  it  is  channel  independent,  i.e., 
the  feedback  channel  is  not  necessary;  it  is  linear  (no  mod- 


*This  research  was  supported  in  part  by  an  initiative  grant 
from  the  Department  of  Electrical  Engineering,  Umverotyof 
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under  Grant  No.  F49620-97-1-0253,  and  the  National  Science 
Foundation  CAREER  Program  under  Grant  MIP-9703377. 


ulo  operation  is  needed);  the  transmitter  or  receiver  does 
not  have  to  know  the  ISI  channel  for  the  equalization,  i.e., 
blind  equalization  is  possible. 

For  the  blind  equalization  with  the  new  precoding  tech¬ 
nique,  no  diversity  at  the  receiver  is  needed  for  a  single  re¬ 
ceiver  system,  and  a  reduced  sampling  rate  over  the  baud 
rate  can  be  achieved  in  an  antenna  array  receiver  system, 
which  are  not  possible  for  the  existing  blind  equalization 
techniques,  see  for  example  (7-8],  without  using  precod¬ 
ing.  For  this  purpose  ambiguity  resistant  (AR)  precoders 
have  been  introduced  in  [2-3]  for  combating  the  ambigu¬ 
ity  induced  by  the  ISI  channel.  Besides  the  AR  precoder 
concept,  some  properties  and  families  of  AR  precoders  are 
presented  in  [2-5]. 

In  this  research,  the  concept  of  the  optimal  precoders 
is  introduced,  when  additive  channel  random  noise  is  con¬ 
cerned.  The  optimality  is  based  on  the  following  crite¬ 
rion:  the  output  symbols  after  the  precoding  should  be  as 
far  away  from  each  other  as  possible  in  the  mean  square 
sense.  This  criterion  is  similar  to  the  one  in  the  modu¬ 
lation  symbol  design  in  communication  systems  to  resist 
random  errors.  Given  a  precoder  G(z),  a  polynomial  ma¬ 
trix  of  the  delay  variable  z_l ,  its  distance  is  introduced  by 
using  the  coefficients  of  its  coefficient  matrices.  It  is  proved 
that  the  distance  is  proportional  to  the  mean  distance  of 
the  ISI  channel  output  symbols,  which  controls  the  perfor¬ 
mance  in  resisting  additive  channel  random  noise.  Thus, 
an  AR  precoder  is  optimal  if  and  only  if  it  has  the  largest 
distance.  We  then  characterize  all  optimal  systematic  AR 
precoders,  where  all  systematic  AR  precoders  are  charac¬ 
terized  in  [4-5].  A  necessary  and  sufficient  condition  for  an 
AR  precoder  to  be  optimal  is  given,  which  is  easy  to  check. 
The  optimality  is  channel  independent.  Finally,  Numerical 
examples  are  presented  to  illustrate  the  theory. 

2  Ambiguity  Resistant  Precoders  via 
ISI  Cancellation 

A  precoded  single  receiver  system  and  undersampiea 
antenna  array  receiver  system  are  shown  in  Fig.  1  and  Fig. 
2  respectively,  where  G(*)  in  Fig.  1  and  G (*)  in  Fig.  2 
a ie  precoders,  H(z),  Hi(z), ....  Hu{z)  are  the  ISI  channe 
transfer  functions,  and  all  of  them  are  either  polynomial 
matrices  or  polynomials  of  the  delay  variable  i  '.In  what 
follows,  boldface  captial  English  letters  denote  polynomial 
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Figure  1:  Single  Antenna  Receiver  with  Baud  Sam¬ 
pling  Rate. 
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be  =  N  -  1  given  N  in  an  AR  precoder. 
let  G(z)  in  Fig.  I  take  the  following  form 


Figure  2.  An  Undersampled  Antenna  Array  Receiver 
System. 

Cinre  the  two  systems  in  Fig-  1  and  Fig.  2  can  be  con 
JSco  two  multi-input  mult.-ou.pu.  (MIMO)  QW** 
the  existing  M1MO  system  identification  ^iqueS-  *** 
for  SLple  19).  can  be  used.  However,  based  on  the* 
results  on  MIMO  system  identification,  one  can  almost 
identify  an  MIMO  system  to  a  constant  matrix  ambiguity. 
t  ordL  to  further  resist  the  constant  matrix  amb.gu.ty 
induced  from  an  MIMO  system  identification  algorithm, 
ambiguity  resistant  precoding  has  been  introduced  n  ^ [2 
3).  A  precoder  G(z)  of  size  S  x  K  is  called  ambiguity 

rtsutant  (AR)  if 

(i)  G(z)  is  irreducible,  i.e.,  matrix  G(z)  has  full  rank  for 
all  complex  values  z  including  z  -  oo, 

(ii)  the  following  «,»..»»  to  K  *  *  J  “ 

trix  V(z)  has  only  trivial  solution  V(z)  -  °  K 

a  nonzero  constant  a: 

£G(z)  =  G(z)V(z),  (21) 

where  E  is  an  N  x  N  nonzero  constant  matrix  and 
/K  is  the  K  x  K  identity  matrix. 

It  has  been  shown  in  [2]  that  G(z)  is  AR  implies  K  <N. 

In  other  words,  the  precoding  has  to  ^pand  ^  ^  ^- 

ples  into  S  samples.  This  is  intuitively  dear  that  cer 


,.[  l 

[  0(M_W)*W  J 


.  U  ^  N  and  G(z)  is  an  JV  x  K  polynomial  matrix. 
Uhas  been  proved  in  (22-23]  that,  if  the  precoders  in  Figs^ 

1-2  take  the  above  forms  and  G(z)  are  AR,  t< Jn it  e |  mpu 
signals  in  the  systems  in  Figs.  1-2  can  be  bhndly  identi- 
fiS  from  the  output  signals,  where  the  ISI 1  ^ 

H,(z\  Hu(z)  may  have  spectrum-null.  ln  (2-5],  fam 

^  rf  ARpldmbi.  b*.  obutood  J»  M.  ■“» 

closed-form  blind  identification  algorithms  have  also  been 

°btSeeAR  precoders  have  been  generalized  in  [3]  to  poly 
J£,  ambiguity  resistant  (PAR)  precoders  for  resisting 
constant  matrix  ambiguities  but  also  Hy-und 
matrix  ambiguities.  The  main  advantage  of  using  PAR 
nrecoders  in  the  systems  in  Fip.  1-2  is  that  one  can  - 
rectly  identify  the  input  signals  from  the  output  sipials 
by  reolving  the  channel  polynomial  ambiguities  ^bou 
uLg  any  MIMO  system  identification  algorithm.  In  the 
rest  of  this  paper,  for  simplicity  we  however 
precoders  although  an  analogous  approach  apphes  to  PAR 

precoders. 

3  Optimal  Ambiguity  Resistant  Pre- 

AhhoughrSall  AR  precoders  found  to  (W)  are  good 
enough  in  theory  to  be  used  to  cancel  the  ISI  ^‘bout^- 
ditive  noise,  AR  precoders  may  have  performance  differ¬ 
ence  when  there  is  additive  noise  in  the  cbannel^Then  the 
question  becomes  which  AR  precoder  is  “better,  wh 
•better*  means  better  symbol  error  rate  performance  at 
the  receiver  after  equalization.  In  this  section,  we  study  a 
criterion  for  AR  precoders  and  also  optimal  AR ^precoders 
by  introducing  the  distance  concept  for  a  precoder. 

3.1  Distance  and  Criterion  for  AR  Pre- 
coders 

To  study  the  above  question,  let  us  briefly  recall  the 
conventional  error  control  coding  theory.  In  error  consol 
..  innuts  code  coefficients  and  outputs  are  all  in  a 

f^  sui  »  0  -d  1.  »d  the  coding  « 

the  finite  field  arithmetic.  Therefore,  the  Hamming 
.wo  Bnite  *****  <*  * 

used.  Moreover,  the  minimum  distance  between  all  coded 
sequencies  can  be  calculated  from  the  code  itself.  Th 
imurn  distance  controls  the  performance  of  the 
st  the  receiver  for  ffifferences  here 

^"he  input,  precoder  coefficients  and  outputs 
S5  in^he  complex-valued  field  and  then  the  channel 
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has  ISI  besides  additive  random  noise.  Although  this  is 
the  case  the  “distance"  of  the  ISI  channel  output  values 
also  controls  the  performance  in  resisting  additive  channel 
random  noise.  To  the  first  issue  the  conventional  Ha^- 
ming  distance  does  not  apply  here  and  the  Euclidean ^dis¬ 
tance  for  the  output  signal  values  after  precoding  needs  to 
be  used.  Since  it  is  hard  to  deal  with  the  minimum  Eu- 
clidean  distance  concept  in  the  complex-valued  field  the 
Euclidean  distance  here  is  in  the  mean  sense  when  the  m- 
put  signal  is  modeled  as  a  complex-valued  random  proce*. 
To  the  second  issue,  we  need  to  investigate  how  the  Eu¬ 
clidean  distance  of  the  output  values  of  a  precoder  affects 
the  Euclidean  distance  of  the  output  values  of  the  ISI  chan- 
tb«  p«rfo.m»c«  »f  the  p-cota 

resisting  additive  random  errors. 

To  study  these  issues,  let  us  go  back  to  the  systems  with 
ISI  in  Figs  1-2.  By  blocking  the  ISI  channels  from  serial 
th.  system,  in  Figs.  M  a.  b. 
one  shown  in  Fig.  3,  where  X(z)  is  the  K  x  1  poty™™* 
matrix  of  the  z-transform  of  the  input  vectors,  G(z)  is  the 
N  x  K  AR  precoder,  H(z)  is  the  M  x  N  polynomial  matrix 
of  the  ISI  channel,  »»(*)  is  the  Af  x  1  polynomial  matrix 
of  the  z-transform  of  the  additive  white  no.se  vectors  and 
Y  (2)  is  the  M  x  1  polynomial  matrix  of  the  z-transform 
of  the  channel  output  vectors. 


let  us  use  matrix  representations  for  linear  transforms 
tions  By  concatenating  all  vectors  X(»)  together  all  vec¬ 
tors  V(n)  together,  all  vectors  U(n)  together,  all  vectors 
,(„)  together,  and  all  vectors  Y( n)  together,  we  obtain 

bigger  block  vectors  X  =  (x(n)),  V  -  (u(n)),  U  -  W")), 
,  =  (q(n)),  and  y  =  (y(n)).  respectively.  Let  Q  and  U 

denote  the  generalized  Sylvester  matrices,  respectively: 


G(Qc) 


G(Qc) 


H(Qh)  •••  H(0) 


...  h(Qh)  ■■■  m 


•  (3-1) 


Figure  3:  Unified  System. 


G(z)  =  53  H(2)  =  £ 

n-0  nm0 

X(z)  =  53  X(n)*-.  Y(z)  =  53  Y(n)z-. 

.  " 

Ut  the  z-transform  of  the  precoder  output  vector  sequence 
V(z)  =  G(z)X(z)  =  53 

n 

and  the  z-transform  of  the  ISI  channel  output  vector  se- 
quence  be 

U(z)  =  H(z)V(z)  =  53t/(n)z""- 

n 

Notice  that  all  X(n),  Y(n),  V(n),  V(n),r,(n)  are  constant 
column  vectors  while  G(n),H(n)  are  constant  matrices. 
To  study  the  mean  distance  for  the  output  values  in  l/(n), 


Tten’  V  =  QX,  U  —  WV,  y  =  U  +  t) .  (3.2) 

In  what  follows,  for  convenience  we  assume  the  input 
signal  x(n)  is  an  i.i.d.  random  process  with  mean  zero 
aS  variance  a\.  Thus,  random  processes  u(n)  and  u(n) 
have  mean  zero.  We  also  assume  all  coefficients  in  the 
ISI  channel  H(z)  are  i.i.d.  with  mean  zero  &nd  jvanance 
ah  and  they  are  independent  of  x(n).  Notice  that 
assumption  is  only  used  to  simplify  the  following  analysis 
and  it  does  not  apply  to  the  single  receiver  ^temmF.g 
1,  where  the  corresponding  channel  matrix  H(z) 

p’seudo-circulant  structure  [10].  ., 

The  mean  distances  between  all  values  of  u(n)  and  all 
values  of  v(n)  are 

d,=  ( E(53  |t»(m)  -  v(n)|,)>j  , 

\  • 

d,  =  (e(5>M  -  «(n)lJA  >  (3-3* 

resoectively  where  E  means  the  expectation.  By  the  as- 
S5SS-  the  coefficients  of  H(z),  it  is  not  hard  to  see 

the  following  relationship  between  the  mean  distance  d  of 

the  ISI  channel  output  values  u(n)  and  the  mean  distance 
d.  of  the  precoder  output  values  (or  the  ISI  channel  input 

values)  v(n):  /,  4\ 

dn  =  and,. 

This  implies  that  the  performance  of  a  precoder  in  resisting 
additive  channel  white  noise  is  proportional  to  the  mean 
distance  of  the  precoder  output  values.  This  result  sol  es 
the  second  issue  arised  in  the  beginning  m  this  section  and 
we  only  need  to  study  the  mean  distance  d.  of  all  the  pre¬ 
coder  output  values  for  the  performance  of  resisting  addi¬ 
tive  channel  random  errors.  Based  on  the  above  analysis, 
we  have  the  following  definition  for  optimal  AR  precoders. 

Definition  1  An  N  x  K  ambiguity  resistant  precoder 
G(z)  is  called  optimal  if  the  mean  distance  d.  of  «« 
precoder  output  values  is  the  maximal  among  alINxK  am¬ 
biguity  resistant  precoders,  when  the  total  energy  is  fixed. 
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The  squared  mean  distance  d,  can  be  calculated  as 
dj  -  7:  Eju(m)  -  u(n)|s 

m.n 

=  2 (LS  -  D  T  E(|t>(n)|?)  -  2  £  E(u(m)v‘  (n)),  (3  5) 

a  m*n 

where  L  is  the  length  of  the  precoder  outpul t  vector  se¬ 
quence  V(n)  and  N  is  the  precoder  sue.  Let  R(mn)\ 
the  correlation  function  of  the  random  process  v(n),  ..e., 

R(m,n)  =  E(v(m)v‘(n)). 

Let  R  be  the  correlation  matrix  of  v{n),  i.e., 

*  =  (R(m,n))  .  E  («*«W)  =  mxX)'«'~‘W 

where  f  means  the  conjugate  transpose.  One  ““  "*  th" 
the  first  term  and  the  second  term  in  the  right  hand  si 
1m  s  for  .he  .tonne,  d,  the  »»  «i  f 
Imee  J  he.,  .he  of  .he  mn.rb.  if  «*?*** 
2„.  ud  the  sun.  of  ell  the  off  diaso"*1  *he 

matrix  US'  »«!.*«  *  toj.  'TTL?,I 
the  squared  mean  distance  d.  can  be  calculated 

dl  =  2o\  -  l)trace(5£* 


;VS>sV.) 

m  ^  n  / 

s  2 o\  (  LA'tracetCff1)  -  j  > 

V  m,n  / 


(3.7) 


where  (CS1)-"-  denotes  the  element  at  the  mth  row  and 

the  nth  column  of  QQ f.  „  .  „  .  %  A 

We  next  want  to  simplify  d»  in  (3  7)  ^  usl“8 
coefficients  in  the  precoder  G (z).  For  a  precoder  G(«), 

define 

DC  =  sum  of  all  coefficients  of  all  coefficient 

matrices  of  G(z)G*(l/z),  (3.8) 

Ec  I  sum  of  all  magnitude  squared  coefficients  of 

all  coefficient  matrices  of  G(z),  (3  9) 

where  G’  means  the  conjugate  transpose  of  all  coefficient 
matrices  of  G(z).  Let  L  be  the  length  of  the  precoder 
output  vector  sequence  V(n).  Then,  by  (3.  ), 
hard  to  sec  that 

tra ce((?C*)  =  LEc,  and  ^(C^’)"*-  =  LDc  (3  10) 

tn,n 

Therefore,  riiit 

dl  m  2 <tIL(LNEg  -  Dc)  (3  n> 

Since  EC  is  fixed  as  the  total  energy  of  all  the  coeffi^ 
cients  of  the  coefficient  matrices  in  G(z),  and  <r,,  L,  an 
S  are  also  fixed,  based  on  formula  (3.11)  for  the  mean  dis¬ 
tance  <£.,  we  have  the  following  criterion  for  judging  the 
performance  of  an  AR  precoder. 


Definition  iNxff  ambiguity  mutant  prec«ier  G(z)  « 
joid  better  than  N  *  K  ambiguity  mutant 
if  Dc  <  Dr  whcn  Eg  =  where  Do,  Df*  G’ 
arc  defined  by  (3.8)-(3.9)  for  prtcoders  G(z)  and  F(z), 

respectively. 

Based  on  formula  (3.11)  on  the  mean  distance  dv  of 
the^recoder  output  values,  we  define  the  distance  for 
precoder  as  follows. 

Definition  3  For  «  M  «  K  precede  GW,  .«  <*««■« 
is  defined  by  _ 

where  Dc  and  Ec  are  defined  in  (3.8)- (3. 9). 

With  the  above  two  definitions  the  following  lemma  is 

straightforward. 

Lemma  1  AR  prtcoder  G(z)  is  fetter  than  AR  precoder 
F(z)  if  and  only  if  the  distance  ofG(z)  u  greater  than  the 
distance  o/F(z),  «•«•.  d(G)  >  d(F). 

Since  the  precoder  output  vector  length  L,  the >  preeoder 

size  N,  and  the  input  ' 

following  theorem  is  straightforward  from  (311). 

Theorem  1  An  N*K  ambiguity  mutant  precoder  G{z) 
is  optimal  in  all  N  x  K  ambiguity  mutant  precoders  if 
and  only  if  the  total  sum  Dc  of  all  the  coefficients  of  M 
the  coefficient  matrices  of  the  product  matrix  G(z)G  (1/z) 
t  ZTmal  among  all  possible  N  x  K  ambiguity  mutant 
precoders  F  (z)  when  the  total  sum  Ec  of  all  the  majnjd 
squared  coefficients  of  all  coefficient  matrices  of  F(z)  u 

fixed . 

Notice  that 

olLDc  =  «\  =  X)E(u(m)w’(n)) 


»eE»(")  *°- 


(3.12) 


Using  (3.11),  the  following  upper  bound  for  the  mean  dis- 
tance  dv  is  proved. 

Theorem  2  The  mean  distance  d,  of  the  precodtr  output 
for  anNxK  precoder  G(z)  is  upper  bounded  by 

d .  <  o.Ls/2NV&3,  (313) 

where  o\  is  the  input  signal  variance,  Lis  tl 

the  precoder  output  vector  sequence,  and  Ec 
(3.9hTe.,  the  total  energy  of  all  coefficients  in  G (z).  The 
upper  bound  for  the  distance  of  an  N  *  K  precoder  G (*) 
isd(G)<N. 


i 
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3.2  Optimal  Systematic  AR  Precoders 


In  this  subsection,  we  determine  all  optimal  systematic 
AR  precoders  with  the  form: 


1 

0 

0 

0 

1 

0 

0 

0 

1 

.  FiW 

F2(z) 

Fn- i(z)  . 

by  using  the  above  criterion.  We  have  the  following  result. 

Theorem  3  An  N  x  (N  -  1)  systematic  ambiguity  resis¬ 
tant  precoder  F(z)  tn  (3.14)  twdk 

Fk(z)  =  Y  <»*'*"'.  *  0,  1  <  *  <  AT  -  1,  (3.15) 

/=o 

for  m  >  n2  >  >  ns- 1  >  1,  u  optimal  if  and  only  if 

"k 

at/  =  -1,  /or  fc  =  1,2,...,  N  -  1.  (3.16) 

1*0 

Moreover,  for  the  above  optimal  precoder,  the  mean  dis¬ 
tance  dv  of  the  precoder  output  values  and  the  precoder 
distance  d(F)  are 

d ¥  =  ozL\f7NVE?,  and  d{F)  =  AT,  (3.17) 

where  o\  is  the  variance  of  the  input  signal,  L  u  the  length 
of  the  precoder  output  vector  sequence  and 

N- 1  «k 

Ef  =  N  -  1  +  Y  11,  ,a4',a-  <3-18) 

k=l  1=0 

This  theorem  also  implies  that  there  exist  AR  precoders 
that  reach  the  upper  bound  (3.13),  i.e.,  Dc  =  0. 

4  Simulation  Results  and  Conclusion 

Some  simulation  results  with  5  different  AR  precoders 
with  different  distances  are  shown  in  Fig.  4. 

In  this  paper,  we  introduced  the  concepts  of  precoder 
distance  and  optima/  AR  precoders  in  justifying  an  AR 
precoder.  Given  an  N  x  K  precoder  G(z),  its  distance 
is  defined  by  d{G)  =  N  -  £>c/£c,  where  Dg  is  the  to¬ 
tal  sum  of  all  coefficients  of  all  coefficient  matrices  of  the 
matrix  G(z)Gt(l/r)  and  Eg  is  the  total  sum  of  all  mag¬ 
nitude  squared  coefficients  of  all  coefficient  matrices  of  the 
matrix  G(z).  With  this  distance  definition,  an  N  x  K 
AR  precoder  is  optimal  if  and  only  if  its  distance  is  N. 
Furthermore,  we  characterized  all  N  x  (N  —  1)  optimal 
systematic  AR  precoders.  With  this  characterization,  one 
is  able  to  construct  all  possible  optimal  N  x  (N  —  1)  sys¬ 
tematic  AR  precoders.  Finally,  numerical  simulations  were 
presented  to  illustrate  the  theory  and  the  concepts.  Our 
numerical  examples  showed  that  an  optimal  AR  precoder 
has  good  performance  in  resisting  both  of  the  channel  ISI 
and  additive  random  noise. 


Figure  4:  Symbol  Error  Rate  Comparison:  Solid  line 
with  *  is  for  Gi(z);  Solid  line  with  +  is  for  G2(z); 
Solid  line  with  o  is  for  G3(z);  Dashed  line  with  x  is 
for  G4(z);  Solid  line  with  x  is  for  G3(z).  d(Gi)  = 
d{ G2)  =  2,  d(G4)  =  <f(G5)  =  4,  d(G3)  «  1.0858. 
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